GPU memory usage is twice (2x) what I calculated based on number of parameters and floating point precision

muellerzr · May 18, 2024, 9:44pm

Nope that’s not what I’m saying at all. There’s certain overhead CUDA needs when doing things under the hood with all their drivers. It is far from 2x otherwise it’d be impossible to train some models (And it’s all usable memory that’s available, just might not be “in use”)

This is more indicitive, however in general if you get cuda OOM that just means that again, you ran out of cuda memory. Looking at either or for hints won’t really per-se do much.

After you’ve gone through the initial parts (so like a step or two in) then you can eyeball the output on nvidia-smi (or GPU memory allocated % when looking at like W&B)

Topic		Replies	Views
Loading of a model takes much RAM, passing to CUDA doesn't free RAM 🤗Transformers	0	774	August 8, 2021
Memory overhead/usage calculation Intermediate	3	53	June 20, 2025
Missmatch between memory-estimate and Trainer-API Beginners	0	183	January 23, 2024
Loading model directly to GPU omitting RAM Beginners	6	84	March 28, 2025
GPU memory calculator 🤗Accelerate	2	1870	July 5, 2024

GPU memory usage is twice (2x) what I calculated based on number of parameters and floating point precision

Related topics