Cuda out of memory issue training whisper model on single GPU

P15V · December 15, 2023, 7:58pm

Hello Hugging face community!!

I hope whoever reads this is having a great day!

So I’m working on the project of fine tuning a whisper model… Got it going with the small model without much issue, Got it working with medium on a smaller dataset
But training with this bigger dataset I have been working on, is not going well with the medium model. I’m trying to train with a batch size of 1, and gradient_accumulation_steps set way too high, even tried 32/64/96. Still met with the same out of memory issue. I’m trying to run this on 1, 2080TI with 12GB of Vram. So I’m wondering Anything I’m not thinking of to try? am I just going to have to leverage another tower with a second GPU(and integrate deep speed), or one with more VRAM?

Genuinely appreciate anyone time & input!! Thanks for reading & hope everyone has a great rest of the day!

Topic		Replies	Views
Whisperx : CUDA out of memory Models	0	1082	December 11, 2023
Fine-tune OPT 13B: CUDA out of memory error (720gb vram, batch size 1, fp16)! Beginners	6	4572	July 25, 2022
Loading extra memory in GPU 0 using DDP Intermediate	0	386	June 18, 2023
Fine Tuning Whisper Beginners	0	325	February 27, 2024
Help needed with issues while trying fine-tune Whisper Beginners	2	1406	April 19, 2024

Cuda out of memory issue training whisper model on single GPU

Related topics