How much memory is needed for mbart-large-cc25?

marton-avrios · July 15, 2020, 4:55am

I tried to finetune without freezing but a 16GB V100 ran out of memory with batch size 1, max source length 512.

akhooli · August 29, 2020, 3:19pm

I am trying to run the mbart En-Ro example and I get CUDA OOM even with max_len 64, n_train 5000 and bs 1 (using Google colab with 15GB gpu P8).
Anyone managed to run it with under 16 GB?

Topic		Replies	Views
Cuda out of memory error Intermediate	11	41627	January 27, 2025
Is it normal of more memory use of DistributedDataParallel than single Beginners	2	820	June 22, 2021
How to increase the batch size of the pretrained mBART50 model? Beginners	0	271	March 16, 2022
RAG batch size on GPU Beginners	0	640	March 2, 2021
Run_mlm.py cuda error memory after resuming a training 🤗Transformers	4	2903	April 21, 2021

How much memory is needed for mbart-large-cc25?

Related topics