I have the following script: import torch from transformers import AutoModelForSpeechSeq2Seq, AutoProcessor, pipeline from datasets import load_dataset import time # Get free TF32 performance increase if the GPU supports it torch.backends.cuda.matmul.allow_tf32 = True torch.backends.cudnn.allow_tf…

Performing Whisper's "transcribe" with Transformer pipelines

panigrah December 19, 2023, 2:20am 2

For your side quest; Add a torch.set_default_device(device) prior to loading the model. The message is harmless anyway as you are moving the model to a device after creating it.

For additional tokens - I believe you have to train, finetune because adding tokens causes the model to lose some of its weights.

Topic		Replies	Views
Difficulty running distil-whisper Beginners	0	125	May 30, 2024
Can't read in audio files for transcription Beginners	2	210	June 29, 2024
Problems tracing fine tuned whisper model to torchscript Beginners	1	395	June 27, 2024
ORTModelForSpeechSeq2Seq load Openai/whisper-large-v3 failed Models	1	52	January 16, 2025
How to use Whisper from huggingface for ASR DeepSpeed	0	538	June 21, 2023

Performing Whisper's "transcribe" with Transformer pipelines

Related topics