Performing Whisper's "transcribe" with Transformer pipelines

For your side quest; Add a torch.set_default_device(device) prior to loading the model. The message is harmless anyway as you are moving the model to a device after creating it.

For additional tokens - I believe you have to train, finetune because adding tokens causes the model to lose some of its weights.