Idle GPU when finetuning whisper tiny

bozden · May 29, 2023, 6:12pm

I had the same problem and after some experimentation, I found out it may be because the data is not fed enough to GPU and it starves, probably because the tiny model does not have many parameters and finishes the calculations quickly (RTX-3090 here).

Although it did not fill the GPU fully, setting the dataloader_num_workers to virtual core count (I have a 6/12 CPU, so I set it to 12) helped a lot. I had many other changes like sharding etc, but I think this is the main parameter that helped.

Topic		Replies	Views
Whisper medium finetuning RTX 4090 mostly stays idle Beginners	5	292	December 7, 2024
Cuda out of memory issue training whisper model on single GPU Intermediate	0	932	December 15, 2023
Help needed with issues while trying fine-tune Whisper Beginners	2	1426	April 19, 2024
How to load common voice dataset locally and fine tune whisper with that Beginners	0	218	April 12, 2024
[Open-to-the-community] Whisper fine-tuning event Community Calls	31	12129	December 10, 2023

Idle GPU when finetuning whisper tiny

Related topics