No output from ASR Pipeline using Whisper

awesomefaec · September 7, 2023, 7:36pm

Hello all,

I am attempting to using Whisper to transcribe long-form audio recordings using pipeline(). I followed the brief tutorial listed on the openai/whisper-small.en model page under the “Long-form Transcription” section.

When I run the pipeline, it returns an output of {text: ''} without any other error message or context. I have tried passing the path to the file (str) and the loaded audio’s waveform (array) and get the same result.

I also tried loading a model and processor, calling model.generate() and then decoding and the output comes back as an empty string like above.

I’m not sure what is happening that is causing it to output an empty string. I have verified these audio files work as I have successfully transcribed them using OpenAI’s Whisper library.

This is the code I am running:

DEVICE = '0' if torch.cuda.is_available() else -1
file_path = 'path/to/audio_file.wav'
model_path = '/path/to/model_dir/.' # contains all the same files in the openai/whisper-small.en repo

model = tf.WhisperForConditionalGeneration.from_pretrained(
   pretrained_model_name_or_path=model_path
   )
tokenizer = tf.WhisperTokenizerFast.from_pretrained(
   pretrained_model_name_or_path=model_path
   )
extractor = tf.WhisperFeatureExtractor.from_pretrained(
   pretrained_model_name_or_path=model_path
   )

pipe = pipeline(
   task='automatic-speech-recognition',
   model=model,
   tokenizer=tokenizer,
   extractor=extractor,
   device=DEVICE
   )

result = pipe(file_path)
print(result)

Output:

{text: ''}

awesomefaec · September 8, 2023, 12:49am

I resolved the issue, turns out there was an issue with one of the downloaded config files.

Topic		Replies	Views
How to set language in Whisper pipeline for audio transcription? 🤗Transformers	2	8900	June 22, 2023
Whisper output for an empty audio Models	0	303	April 17, 2024
Is prompt properly implemented in the whisper model? 🤗Transformers	1	1566	September 19, 2024
Can't read in audio files for transcription Beginners	2	210	June 29, 2024
Finetuned whisper model translating instead of transcribing 🤗Transformers	2	734	December 31, 2023

No output from ASR Pipeline using Whisper

Related topics