Wav2vec2-xls-r-2b out of memory issues on A100 (40 GB)

ssrockonindia · May 13, 2022, 1:25am

I am trying to finetune wav2vec2-xls-r-2b model on some common voice dataset but it is giving me a memory error. I even tried lowering the batch size (2 or 4) but it gives me the same error. I have 8 A100 GPUs, even if I specify 3 or 4 of them to use it gives me the same error. Also, I fine-tuned xlsr-53, xls-r-300M and xls-r-1B models with batch size 64 on the same dataset, it worked without any out-of-memory issues. Here is the error:

RuntimeError: CUDA out of memory. Tried to allocate 20.00 MiB (GPU 0; 39.59 GiB total capacity; 35.68 GiB already allocated; 6.19 MiB free; 37.51 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation.  See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

How to solve these memory issues? Help would be really appreciated.

Topic		Replies	Views
Wav2vec2.0 memory issue Models	13	11499	December 25, 2024
How much memory to fine tune wav2vec2? Models	2	1144	March 7, 2022
Constantly running out of memory fine-tuning Wav2Vec2 DeepSpeed	1	975	April 28, 2022
Wav2vec2 not releasing memory after batch Models	1	469	May 22, 2023
Multi GPU Audio Finetuning for Wav2vec2 Failing for 4 GPUs but successful for 1 GPU Beginners	0	307	July 9, 2023

Wav2vec2-xls-r-2b out of memory issues on A100 (40 GB)

Related topics