GPU memory calculator

ron5569 · July 3, 2024, 9:43am

I am looking for a calculator tool that can estimate the amount of memory a GPU machine instance will use in advance. Specifically, I’m interested in the following configuration:

FSDP Zero 2
Model: e.g., LLama3 8B

-Lora

Max tokens: 8K
16-bit precision
Gradient checkpointing
No quantization

Any guidance or recommendations would be greatly appreciated! Thank you.

Edit: I know about hf tool (Understanding how big of a model can fit on your machine) but this is rudimentary and lacks many of parameters such as max sequence token, lora, …

lance42 · July 4, 2024, 11:34am

This one has a bit more than the HF one: Tech 42. There’s an option to add a custom model.

nikita200 · July 5, 2024, 12:33pm

You can try the Python code for the calculations. Use the formulas with all parameters:

total_memory=(model_memory+activation_memory+lora_memory)Xcheckpointing_factorXfsdp_factor

Topic		Replies	Views
GPU memory bandwidth calculation Models	0	79	August 13, 2024
VRAM keeps increasing during sequential llama2-13b inferencing Models	1	288	July 15, 2024
Memory overhead/usage calculation Intermediate	3	37	June 20, 2025
How to predict the memory requirements for a given model? Models	0	744	June 9, 2022
Estimate training compute for 150B LLM DeepSpeed	0	531	June 30, 2023

GPU memory calculator

Related topics