Why does all my gpu memory get used with a small model?

imranq · March 5, 2022, 6:50pm

I’m using colab pro to try out a few huggingface models, but even loading a 2gb model completely fills up the GPU memory (Tesla 16gb) and then as the data loads i frequently run out of memory. Can anyone explain why a small model will maximize gpu memory?

My dataset size is about 1gb between train and test if that makes a difference

anwarika · March 7, 2022, 12:40am

I can’t say for sure without seeing how you are running it. Can you troubleshoot and see whats running on your gpu?

imranq · March 12, 2022, 5:22pm

Thanks! Do you know how I can see what processes are running in the terminal?

anwarika · March 13, 2022, 2:11pm

Can you supply your parameters on how you are training? ie batchsize, epochs, etc. I have run into memory issues with 1gb model, 2gb is actually pretty big when training.

To check gpu resources in googlecolab you can try something like this

#On the left side you can open Terminal ('>_' with black background)

#You can run commands from there even when some cell is running

#Write command to see GPU usage in real-time:

$ watch nvidia-smi

imranq · March 13, 2022, 2:54pm

Thanks! I will try that out

BramVanroy · March 13, 2022, 7:20pm

Are you using Tensorflow? By default it allocates all the available GPU memory. There are ways around that though.

Topic		Replies	Views
Huge disparity between CPU and GPU memory usage? 🤗Transformers	0	405	February 22, 2022
[Diffusers] PyTorch running out of memory 🧨 Diffusers	1	778	August 30, 2022
Google/gemma-2-2b-it Crashes in Google colab Models	0	52	September 5, 2024
TF bert-base-uncased reserves large memory space 🤗Transformers	1	854	June 24, 2022
Always getting RuntimeError: CUDA out of memory with Trainer 🤗Transformers	10	6920	April 4, 2024

Why does all my gpu memory get used with a small model?

Related topics