Fine-tuning Wav2Vec2 for English ASR with 🤗 on local machine Transformers

TzurVaich · August 10, 2021, 11:02am

I’m running https://huggingface.co/blog/fine-tune-wav2vec2-english#training–evaluation example on my local machine and getting Training Loss=nan

Step	Training Loss	Validation Loss	Wer	Runtime	Samples Per Second
200	nan	13.842948	2.703102	204.199500	8.227000
400	nan	13.842948	2.703102	204.301000	8.223000
600	nan	13.842948	2.703102	204.371700	8.220000

Local modifications to the example

training_args = TrainingArguments(
output_dir="./wav2vec2-base-timit-demo",
group_by_length=True,
per_device_train_batch_size=4, ** changes from 32 **
…
save_steps=200,
eval_steps=200,
logging_steps=100,
…
)

TzurVaich · August 10, 2021, 5:27pm

Update - by moving the job to the CPU the Training loss is getting values.
steps so far

make cuda unavailable
import torch

torch.cuda.is_available = lambda : False

disabling Mixed precision
training_args = TrainingArguments(
…
# fp16=True,
…)

*Mixed precision training with AMP or APEX (--fp16) and FP16 evaluation can only be used on CUDA devices. *

Topic		Replies	Views
Fine-tuning wav2vec2 loss explodes and then goes to zero after certain time-steps 🤗Transformers	0	438	November 17, 2021
Fine tuning Wav2vec for wolof Beginners	10	538	November 30, 2021
Portuguese ASR: Fine-Tuning XLSR-Wav2Vec2 Languages at Hugging Face	10	1539	April 16, 2021
Wav2Vec2 Fine Tuning Models	0	258	December 21, 2023
Finetuning Wav2Vec2 loss constant Beginners	1	301	August 14, 2023

Fine-tuning Wav2Vec2 for English ASR with 🤗 on local machine Transformers

Related topics