Why is Wav2Vec pretraining loss not decreasing?

ZaNi · July 6, 2021, 7:34pm

Hi there everyone

I’m currently trying to train a Wav2Vec base model. During the pre-training phase, The loss starts off around 4, decreases and then shoots up to 6.658 and stays there. The accuracy is also low and does not increase. My learning rate is set at 0.005. I started off with a learning rate of 0.0001 and started increasing it gradually when I saw these results. I use the english Wav2Vec model for weight initialisation. I thought it would improve if I waited longer but it stays the same even after 20 epochs. Can anyone please share some advice on what I could do to avoid this and improve the training?

Your assistance will be much appreciated!

KhusainovAidar · July 21, 2021, 7:39am

Hi ZaNi,
Have you found the reason for such behaviour? I’m facing the same problem pretraining my model from English base model.

AndySun · July 29, 2021, 1:45am

@ZaNi @KhusainovAidar Hi, I have the same problem. Is there any solution?

AndySun · July 29, 2021, 1:54am

Hi @patrickvonplaten

Thank you for your time!
I followed this pretrain script, but the loss is not decreasing. Like this

Could you give us some advice？ Thank you!

Andy

KhusainovAidar · July 29, 2021, 4:03pm

@AndySun I haven’t found it. The strange thing that after restarting training from checkpoint (for which loss score was already near zero) it shows more realistic loss scores for first several steps and after that goes to 0.0003 again. So for now I’ve just trained wav2vec2 with fairseq and converted the resulting model into TorchScript.

AndySun · July 30, 2021, 12:36am

@KhusainovAidar Okay. Thanks for your information!

AndySun · July 31, 2021, 8:59am

@KhusainovAidar Hi Bro, could you tell us how to convert the fairseq pre-trained model to Transformers wav2vec formant?

Thank you!

KhusainovAidar · July 31, 2021, 9:36am

There is no such option, at least I haven’t found it. I’m using this repo to convert fairseq model to TorchScript and use it in ‘server’-mode audio/examples/libtorchaudio/speech_recognition at master · pytorch/audio · GitHub

AndySun · July 31, 2021, 9:51am

Okay! The TorchScript can be used for ASR product. Thanks a lot!

AndySun · August 10, 2021, 11:06am

Hi, @KhusainovAidar

I found one solution to convert fairseq pre-trained model to Transformers format. For your reference.

python -m transformers.models.wav2vec2.convert_wav2vec2_original_pytorch_checkpoint_to_pytorch --pytorch_dump_folder_path ./converted_model/ --checkpoint_path /path/to/**.pt --not_finetuned

mfox · April 29, 2022, 4:53pm

I’m also having this issue. Has anybody figured out what the issue is? I’m using the following hyperparameters:

python ./code/pretrain.py \
	--dataset_name="./code/VoicesOfColor" \
	--dataset_config_names="train" \
	--dataset_split_names="TEST" \
	--output_dir="./code/wav2vec2-pretrained-VOC/artefacts" \
	--model_name_or_path="patrickvonplaten/wav2vec2-base-v2" \
	--max_train_steps=600000 \
	--num_warmup_steps=32000 \
	--gradient_accumulation_steps=4 \
	--learning_rate=0.001 \
	--weight_decay=0.01 \
	--max_duration_in_seconds=30.0 \
	--min_duration_in_seconds=2.0 \
	--logging_steps=1 \
	--saving_steps=10000 \
	--per_device_train_batch_size=8 \
	--per_device_eval_batch_size=8 \
	--adam_beta1=0.9 \
	--adam_beta2=0.98 \
	--adam_epsilon=1e-06

Topic		Replies	Views
Wav2Vec2: loss growing in training and validation after few epochs Models	6	2043	September 25, 2024
Wav2Vec2: fix growing training and validation loss after few epochs Models	5	2241	January 27, 2022
Finetuning Wav2Vec2 loss constant Beginners	1	301	August 14, 2023
Is there any tutorial 4 pretrain wav2vec2 Beginners	4	977	November 6, 2024
Pre-training for Wav2Vec2-XLSR via Huggingface Models	15	5346	November 5, 2024

Why is Wav2Vec pretraining loss not decreasing?

Related topics