Wav2vec model train from scratch

Razvanip · December 28, 2021, 9:02am

Hi,

I’m new to the field of automatic speech recognition. I have a research project where we try to make a speech to text translator for Romanian medics. I saw that there are many pre-trained models for different languages which people seem to fine-tune them.

I wanted to know if it’s possible to train wav2vec for a specific language from scratch. If the answer is yes, could somebody give me an example for one language?

flozi00 · December 31, 2021, 1:45am

You can pretrain it using this Script transformers/run_wav2vec2_pretraining_no_trainer.py at master · huggingface/transformers · GitHub

but except it could be really unstable to pretrain from scratch as it’s written in the readme

Topic		Replies	Views
Need help on wav2vec 2.0 models training 🤗Transformers	2	929	July 31, 2021
Is there any tutorial 4 pretrain wav2vec2 Beginners	4	982	November 6, 2024
Pretrained wav2vec2 speech to text - decoded text is gibberish Models	0	402	June 12, 2023
Phoneme Recognition Model 🤗Transformers	1	381	September 25, 2021
Further train a fine tuned wav2vec model 🤗Transformers	2	531	September 25, 2022

Wav2vec model train from scratch

Related topics