Whisper large v3 Finetune result is all lowercased

hafiz-tribetech · February 11, 2025, 10:57am

I have trained Whisper large-v3 model on my dataset of around 5-6 hours. The results are really good. But i noticed that the text is all lowercased and without punctuation. Is there any parameter that i missed during finetuning or during inference ?
I used this https://huggingface.co/blog/fine-tune-whisper blog to finetune on my dataset.

John6666 · February 11, 2025, 12:48pm

I found a similar case.

hafiz-tribetech · February 11, 2025, 3:23pm

Thank you for replying. But in my case normal whisper-large-v3 works perfectly and the transcript is always with punctuation. But when i fine tune it only then i get the results in lower case. I think may be because i fine tune it with the dataset that is lowercased and normalized thats why the resulting transcript is also lowercased.

Topic		Replies	Views
Finetuned whisper model translating instead of transcribing 🤗Transformers	2	734	December 31, 2023
Whisper fine tuning on custom audio data Beginners	4	2723	February 15, 2025
Whisper finetune for multilingual tasks Beginners	0	175	February 21, 2024
Whisper: padding issues while transcribing Beginners	0	58	January 1, 2025
Problem with finetuning model whisper Beginners	0	87	November 7, 2024

Whisper large v3 Finetune result is all lowercased

Related topics