CUDA is out of memory

Constantin · March 11, 2021, 7:45pm

Hi

I finetune xml-roberta-large according to this tutorial. I met a problem that during training colab CUDA is out of memory.

RuntimeError: CUDA out of memory. Tried to allocate 978.00 MiB (GPU 0; 14.76 GiB total capacity; 12.62 GiB already allocated; 919.75 MiB free; 12.83 GiB reserved in total by PyTorch

And it is given that batch_size = 1

I tried to do that on xml-roberta-base, training lasts longer but over all ends up with the same problem. I tried bert-base-uncase that is in tutorial and it’s okay. But my data is multillingual!

I want to understand is it true that this problem is just because of natural limits of Colab or it is my fault. Is it possible to finetune xml roberta large in Colab?

Thanks!

lewtun · March 11, 2021, 8:01pm

Hi @Constantin, it’s possible that you’re getting allocated one of the K80 GPUs on Colab which probably doesn’t have enough RAM to handle xlm-roberta-large.

You can “cheat” you way to a better GPU (either Tesla T4 or P100) by selecting Runtime > Factory reset runtime in the settings:

You can check what kind of GPU your notebook is running by executing the following in a code cell:

!nvidia-smi

neuralpat · March 12, 2021, 8:57am

You could also try kaggle. It’s very similar and gives you a P100 (I think they give you more memory as well).

tolu07 · October 9, 2023, 6:12am

Tried using kaggle’s P100, but I am getting the same error, i.e.

OutOfMemoryError: CUDA out of memory. Tried to allocate 384.00 MiB (GPU 0; 15.90 GiB total capacity; 14.70 GiB already allocated; 245.75 MiB free; 14.78 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation.  See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Any advice on using another model for sequence classification task rather than xlm?

Topic		Replies	Views
RuntimeError: CUDA out of memory. Tried to allocate 384.00 MiB (GPU 0; 11.17 GiB total capacity; 10.62 GiB already allocated; 145.81 MiB free; 10.66 GiB reserved in total by PyTorch) Beginners	8	27520	December 10, 2023
Colab RAM crash error - Fine-tuning RoBERTa in Colab Beginners	3	6538	December 15, 2020
Hyperparameter Tuning QNLI Colab Example using RoBERTa "RuntimeError('CUDA out of memory..." 🤗Transformers	0	311	May 20, 2021
RuntimeError: CUDA out of memory. Tried to allocate 1.91 GiB (GPU 0; 15.78 GiB total capacity; 12.36 GiB already allocated; 302.75 MiB free; 14.16 GiB reserved in total by PyTorch) Beginners	2	1376	September 11, 2021
Solving "CUDA out of memory" when fine-tuning GPT-2 🤗Transformers	0	1422	January 6, 2022

CUDA is out of memory

Related topics