Wav2vec2 xlsr nan train loss

tadf · June 11, 2021, 5:43am

Hi,
I’m running into nan training_loss when training wav2vec2 xlsr with my custom dataset.
Weird thing is that even though training_loss goes to nan, eval_loss still goes down, and error_rate (cer and wer) also goes down.
I’ve experimented with lower learning_rate, but still getting similar behavior. I’m logging with wandb.

My graphs look like the following:

There’s no value for train/loss after ~60 steps since it is nan, but eval/loss is still decreasing.

Has anyone experienced similar behavior?

tadf · June 14, 2021, 4:22am

I’ve let it train over the weekend, still NAN train loss, but eval loss and both WER and CER continue to decrease

Topic		Replies	Views
`nan` training loss but eval loss does improve over time Research	5	4006	October 10, 2022
Wav2Vec2: How to correct for nan in training and validation loss Models	13	9815	October 22, 2023
Fine-tuning Wav2v2.0: Loss increasing, WER decreasing Models	2	576	June 30, 2023
Training and evaluation loss goes down however, WER score stays the same 🤗Transformers	0	368	May 23, 2022
Finetuning Wav2Vec2 loss constant Beginners	1	301	August 14, 2023

Wav2vec2 xlsr nan train loss

Related topics