Finetuned whisper model translating instead of transcribing

saimadhavang · September 9, 2023, 2:24pm

I have finetuned the whisper small model to my custom dataset in the Kannada language and got a good WER while training it, thanks to this great blog.
However, when I try to run inference on it using pipeline, it is translating the audio into English rather than transcribing it to Kannada. Here is a jist of what I am doing in my code:

processor = WhisperProcessor.from_pretrained("openai/whisper-small", language="Kannada", task="transcribe")
tokenizer = processor.tokenizer
feature_extractor = processor.feature_extractor
model = WhisperForConditionalGeneration.from_pretrained('<repo of my finetuned model>', use_auth_token=True)
pipe = pipeline(task = 'automatic-speech-recognition', model=model, tokenizer=tokenizer, feature_extractor=feature_extractor, device=0)
dataset = load_dataset('<my dataset>', split='test')
transcriptions = {'y':[], 'pred':[]}
def process_clip(clip):
    trans['y'].append(clip['sentence'])
    trans['pred'].append(pipe(clip['audio'])['text'])
for clip in tqdm(dataset):
  process_clip(clip)

Any help will be appreciated, thanks for taking the time!!!

cociweb · December 30, 2023, 8:45pm

Were you able to find the solution? Is it only a HF symptome or this problem is valid on other api environment as well? The problem exists with small model only?

saimadhavang · December 31, 2023, 7:04am

I wasn’t able to find a solution. This issue was not occurring with the medium model. I haven’t experimented with other API environments.

Topic		Replies	Views
Korean finetuning on Whisper Beginners	1	1605	February 25, 2024
Openai Whisper Finetune checkpoint in local directory Beginners	0	265	March 21, 2024
Unable to run whisper small finetune after training Beginners	2	93	November 30, 2024
Fine Tuning Whisper on my own Dataset with a customized Tokenizer Beginners	16	12404	February 12, 2024
How to set language in Whisper pipeline for audio transcription? 🤗Transformers	2	8917	June 22, 2023

Finetuned whisper model translating instead of transcribing

Related topics