Two Whisper classes for generation but same functionalities?

Jingya · July 16, 2024, 12:28pm

I had the same question, and it turns out that WhisperForCausalLM is the class solely used to load the assistant model for speculative decoding.

Without loading the whole encoder-decoder, WhisperForCausalLM only loads the decoder with a language modeling head on top.

Topic		Replies	Views
Disparity between output from `forward` and `generate` for greedy search (using Whisper) 🤗Transformers	3	1352	August 11, 2024
Is prompt properly implemented in the whisper model? 🤗Transformers	1	1604	September 19, 2024
Is it possible to use WhisperModel for an audio classification task? 🤗Transformers	0	376	November 18, 2022
About finetuning whisper 🤗Transformers	0	210	May 5, 2023
Whisperx training hf_tokenizer 🤗Transformers	0	246	February 9, 2024