Constantly running out of memory fine-tuning Wav2Vec2

sfalk · December 20, 2021, 10:06am

I am currently trying to fine-tune the facebook/wav2vec-base model but I am constantly running into memory issues after a few epochs:

  File "/home/sfalk/miniconda3/envs/speech/lib/python3.8/site-packages/transformers/models/wav2vec2/modeling_wav2vec2.py", line 631, in forward
    hidden_states, attn_weights, _ = self.attention(
  File "/home/sfalk/miniconda3/envs/speech/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1102, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/sfalk/miniconda3/envs/speech/lib/python3.8/site-packages/transformers/models/wav2vec2/modeling_wav2vec2.py", line 553, in forward
    attn_weights = nn.functional.softmax(attn_weights, dim=-1)
  File "/home/sfalk/miniconda3/envs/speech/lib/python3.8/site-packages/torch/nn/functional.py", line 1680, in softmax
    ret = input.softmax(dim)
RuntimeError: CUDA out of memory. Tried to allocate 4.24 GiB (GPU 0; 10.92 GiB total capacity; 5.63 GiB already allocated; 3.44 GiB free; 6.67 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation.  See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

I am running this training on 4x GeForce 1080 GTX (11GB VRAM each) using only a batch-size of 1 and no gradient-accumulation whatsoever.

Can I do anything about this? Should I use a smaller model to begin with? If so… what options do I have?

marcomoldovan · April 28, 2022, 4:44pm

Running into the same issue, did you find out what caused this?

Topic		Replies	Views
Wav2vec2.0 memory issue Models	13	11498	December 25, 2024
Wav2vec2-xls-r-2b out of memory issues on A100 (40 GB) Models	0	682	May 13, 2022
How much memory to fine tune wav2vec2? Models	2	1143	March 7, 2022
Wav2vec2 not releasing memory after batch Models	1	469	May 22, 2023
Wav2vec2.0 memory issue for basic inference Models	1	626	June 12, 2023

Constantly running out of memory fine-tuning Wav2Vec2

Related topics