Wav2Vec2 different on Colab and Apple M2 Max

kadriu · August 7, 2023, 4:31pm

I am going through Fine-Tune XLSR-Wav2Vec2 on Turkish ASR with Transformers blog. Everything is ok when on Colab. But when I try the same code/data on Apple M2 Max, I get WER 1.0, predicting empty strings. Resampling with Audio to 16000 produces different values for the elements of audio arrays on Colab vs M2 Max. This is very frustrating, since I cannot use M2 Max to do local trainings. Any idea how can this be improved?

kadriu · August 14, 2023, 10:16am

When I do the training only on cpu (use_cpu=True in trainings_arg), the produced WER is the same, but it goes much slower. It seems it has to do with Pytorch and Apple’s MPS/GPU. Any idea how to employ MPS to produce correct training?

Topic		Replies	Views
Wav2Vec2 WER remains 1.00 and return blank transcriptions Models	14	2859	June 10, 2025
Wav2Vec2 Fine Tuning Models	0	256	December 21, 2023
Polish ASR: Fine-Tuning Wav2Vec2 Languages at Hugging Face	0	431	March 19, 2021
Word Error Rate in Wav2vec2 Fine Tuning Beginners	0	242	November 18, 2022
Portuguese ASR: Fine-Tuning XLSR-Wav2Vec2 Languages at Hugging Face	10	1537	April 16, 2021

Wav2Vec2 different on Colab and Apple M2 Max

Related topics