ASR help with sequence of words

Davide85 · March 20, 2023, 11:58am

Hi there,

by following this tutorial:

I have finetuned the model “facebook/wav2vec2-base” on my custom dataset containing single words uttered by people with atypical speech. I have observed a high word recognition accuracy (greater than 95%). Now I would like to use the same dataset to recognize small sequences of words. As an example, if I have trained the model on the keywords “volume” and “up”, I would like to recognize the sequence “volume up” within a speech recording. Is it possible? Any idea to achieve this with Transformers?

Thanks in advance,

Davide

fydhfzh · March 25, 2024, 7:00pm

have you found any related answer? i got the same problem here

Topic		Replies	Views
Wav2vec2 finetuning and language model Beginners	0	214	October 1, 2023
Wav2vec tends to merge words 🤗Transformers	0	203	March 5, 2023
How to use Whisper from huggingface for ASR DeepSpeed	0	540	June 21, 2023
Arabic ASR: Fine-Tuning Wav2Vec2 Languages at Hugging Face	3	2284	December 27, 2024
Phoneme Recognition Model 🤗Transformers	1	382	September 25, 2021

ASR help with sequence of words

Related topics