RuntimeError: CUDA out of memory. Tried to allocate 1.91 GiB (GPU 0; 15.78 GiB total capacity; 12.36 GiB already allocated; 302.75 MiB free; 14.16 GiB reserved in total by PyTorch)

anon58275033 · August 25, 2021, 11:08pm

Hi,

I am trying to train a language model from scratch, but when I try to train my model, I get this error:

RuntimeError: CUDA out of memory. Tried to allocate 1.91 GiB (GPU 0; 15.78 GiB total capacity; 12.36 GiB already allocated; 302.75 MiB free; 14.16 GiB reserved in total by PyTorch)

Can anyone help me?

I am so close to training my model, but this keeps happening.

Also, I have tried this code but it does not seem to fix my issue:

import torch, gc

gc.collect()
torch.cuda.empty_cache()

LULU0X01 · September 11, 2021, 3:09am

maybe you can try lower your per_gpu_batch_size in TrainingArguments.

TheLongSentance · September 11, 2021, 1:30pm

After batch size reduction, then consider adafactor optimiser instead of Adam. Also if you can get fp16 training to converge successfully for your chosen model then that would be cut memory needs a lot.

Topic		Replies	Views
RuntimeError: CUDA out of memory. Tried to allocate 11.53 GiB (GPU 0; 15.90 GiB total capacity; 4.81 GiB already allocated; 8.36 GiB free; 6.67 GiB reserved in total by PyTorch) Beginners	4	3067	April 20, 2021
torch.cuda.OutOfMemoryError 🤗Transformers	0	2053	July 5, 2023
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 256.00 MiB (GPU 0; 39.56 GiB total capacity; 37.84 GiB already allocated; 242.56 MiB free; 37.96 GiB reserved in total by PyTorch) 🤗Transformers	2	5347	June 7, 2023
RuntimeError: CUDA out of memory even with simple inference Beginners	1	5372	January 16, 2022
Cuda out of memory error Intermediate	11	41751	January 27, 2025

RuntimeError: CUDA out of memory. Tried to allocate 1.91 GiB (GPU 0; 15.78 GiB total capacity; 12.36 GiB already allocated; 302.75 MiB free; 14.16 GiB reserved in total by PyTorch)

Related topics