How much memory to fine tune wav2vec2?

willcai · March 6, 2022, 9:30pm

I’m trying to replicate this blog post on fine tuning XLSR (Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers ) and I’m running into CUDA out of memory issues. I’m training on a machine with multiple nvidia titan V (12 gb memory) and even when I:

reduce batch size to 1
remove all clips with > 5 seconds (even reduced this down to 2 seconds)
use adafactor instead of adamw (as suggested here: Performance and Scalability: How To Fit a Bigger Model and Train It Faster)

I still run out of memory. I’m not sure if this suggests there is a bug in my code somewhere or I simply don’t have enough memory to do this - any advice would be appreciated!

anwarika · March 7, 2022, 12:36am

Is it able to start training before running into Cuda issues? Have you tried model sharding if you only have access to 12gb gpus? You could try using cloud resources, they have 15gb gpus and 32gb gpus.

willcai · March 7, 2022, 10:20pm

I think it doesn’t get to the stage where it says x/y epochs, etc, but the cell with trainer.train() does at least some computation before running out of memory.

I haven’t tried model sharding, thanks for suggesting that - I’ll look into it!

If that doesn’t work, maybe I’ll look into cloud options.

Topic		Replies	Views
How to finetune wav2vec2.0-xlsr model with long audio files Beginners	1	833	September 6, 2022
Wav2vec2.0 memory issue Models	13	11554	December 25, 2024
Wav2vec2-xls-r-2b out of memory issues on A100 (40 GB) Models	0	685	May 13, 2022
Constantly running out of memory fine-tuning Wav2Vec2 DeepSpeed	1	977	April 28, 2022
Training wav2vac2 requires a lot of compute power 🤗Transformers	0	194	March 21, 2023

How much memory to fine tune wav2vec2?

Related topics