CUDA out of memory for Longformer

shainaraza · October 11, 2020, 4:03pm

I have issue training the longformer on custom dataset, even on a small batch number, it says CUDA out of memory,

RuntimeError Traceback (most recent call last)
in ()
----> 1 trainer.train()

18 frames
/usr/local/lib/python3.6/dist-packages/torch/nn/functional.py in _pad(input, pad, mode, value)
3550 assert len(pad) // 2 <= input.dim(), ‘Padding length too large’
3551 if mode == ‘constant’:
-> 3552 return _VF.constant_pad_nd(input, pad, value)
3553 else:
3554 assert value == 0, ‘Padding mode “{}”" doesn’t take in value argument’.format(mode)

RuntimeError: CUDA out of memory. Tried to allocate 1.13 GiB (GPU 0; 15.90 GiB total capacity; 11.40 GiB already allocated; 659.81 MiB free; 14.39 GiB reserved in total by PyTorch)

Emanuel · October 20, 2021, 1:32pm

Did you try smaller batch sizes? What is the size of single batch size in your RAM?

shainaraza · October 20, 2021, 1:44pm

I tried 8 batch size, do not remember the single batch size. I use colab so it has its own limitations even the Pro version. any thoughts?

Emanuel · October 20, 2021, 2:50pm

Your input sentences are being limited to a maximum size?

shainaraza · October 22, 2021, 11:26am

I truncated to lesser as well.

Emanuel · October 22, 2021, 12:00pm

Could you provide a simple snippet to reproduce the OOM?

shainaraza · October 22, 2021, 1:06pm

if you see my post above, its months ago, its not actively something I am working on, I didn’t get answer but I solved that time by having smaller batches and less input.
I have few more issues like batch processing, having predictions at the end with a score, do you mind discussing that?

Topic		Replies	Views
Always getting RuntimeError: CUDA out of memory with Trainer 🤗Transformers	10	6908	April 4, 2024
CUDA out of memory on multi-GPU 🤗Transformers	1	2649	March 6, 2024
RuntimeError: CUDA out of memory. Tried to allocate 11.53 GiB (GPU 0; 15.90 GiB total capacity; 4.81 GiB already allocated; 8.36 GiB free; 6.67 GiB reserved in total by PyTorch) Beginners	4	3067	April 20, 2021
RuntimeError: CUDA out of memory even with simple inference Beginners	1	5372	January 16, 2022
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 256.00 MiB (GPU 0; 39.56 GiB total capacity; 37.84 GiB already allocated; 242.56 MiB free; 37.96 GiB reserved in total by PyTorch) 🤗Transformers	2	5347	June 7, 2023

CUDA out of memory for Longformer

Related topics