How to quickly determine memory requirements for model

cmr99 · June 15, 2023, 10:25pm

Hey all - as I browse models for those that suit my project, I am trying to quickly determine the memory requirements for running each model locally. This seems like something many users would want to do and so there should be an obvious place to look on a model card to make this determination, but I don’t see any such place. I don’t even see a place where the parameter count is consistently stated. How should I approach this?

mariosasko · June 19, 2023, 4:23pm

Hi! You can find this info by checking the size of pytorch_model.bin (or tf_model.h5/flax_model.msgpack for TF/Flax models). These files can be sharded sometimes (if pytorch_model.bin.index.json is present), in which case you need to sum up all the shards listed in the index file.

PS: For the parameter count to be displayed, the weights must be saved in the safetensors format.

cmr99 · June 24, 2023, 9:40am

Thanks very much for the help.

pvelosipednikov · September 21, 2023, 6:07pm

How would I determine these requirements when I’m finetuning a model? I’m finetuning a 780 MB model with a 5 MB dataset. On my local machine this runs fine (16 GB ram) but when I use a GPU (Tesla T4 with 16 GB of GPU ram), I immediately get an OOM error when I launch trainer.train().

MattiLinnanvuori · September 24, 2023, 11:17am

Have you tried Model Memory Calculator? Model Memory Utility - a Hugging Face Space by hf-accelerate

pvelosipednikov · September 24, 2023, 12:57pm

Nice link, thanks.

So finetuning should consume less memory than training, and training the model I’m using is just 4.57 GB according to the calculator. So I’m probably misunderstanding something about the model architecture since I have 16 GB GPU RAM.

hvinay · November 7, 2023, 5:44am

Total memory required during training/tuning should depend on the batch size right? Why batch size is not an input to this utility?

odiouschipmunk · August 28, 2024, 7:27pm

The batch size is automatically set to one as said in, “When training on a batch size of 1”.

Topic		Replies	Views
Easily calculate memory usage to train your model Models	0	367	August 30, 2024
Determining if a model will run locally Beginners	4	572	April 7, 2025
Memory Requirements for Running LLM Beginners	2	7299	May 8, 2024
LLaMA 7B GPU Memory Requirement 🤗Transformers	19	152750	February 23, 2025
GPU memory calculator 🤗Accelerate	2	1833	July 5, 2024

How to quickly determine memory requirements for model

Related topics