Using Padding for ASR models

Miraz1993 · December 16, 2022, 5:56pm

Hi! I have recently started using ASR models and found huggingface to be very helpful. However, I have a query related to the Wav2Vec2 processor. I found that the generated output is padded but I can not find the position of eos_token in the generated text ids. Is the token not used if we use padding?

I have used the WavLM and executed the demo code provided in the link: WavLM

Topic		Replies	Views
How to set the padding configuration with Huggingface's GenerateMixin's generate method? Intermediate	7	11153	September 26, 2023
Wav2vec - <s></s> tokens Models	0	306	January 18, 2022
Should the padding token be ignored in the loss function? 🤗Transformers	0	1274	August 24, 2021
Seeking an end-to-end example of grouping, tokenization and padding to construct preprocessed data in HF 🤗Tokenizers	0	391	June 26, 2023
Question about Wav2vec2 Models	1	544	May 6, 2022

Using Padding for ASR models

Related topics