GPU memory usage is twice (2x) what I calculated based on number of parameters and floating point precision

muellerzr · May 18, 2024, 5:32pm

Also do note that your GPU will reserve some space in it when the driver warms up. It’s better to use torch.cuda.memory_allocated() here.

E.g. just reserving a tiny tensor on the GPU will use 152MiB in nvidia-smi:

import torch

t = torch.tensor([0.,1.]).cuda()

import time
time.sleep(10)

Topic		Replies	Views
Loading of a model takes much RAM, passing to CUDA doesn't free RAM 🤗Transformers	0	774	August 8, 2021
Memory overhead/usage calculation Intermediate	3	53	June 20, 2025
Missmatch between memory-estimate and Trainer-API Beginners	0	183	January 23, 2024
Loading model directly to GPU omitting RAM Beginners	6	84	March 28, 2025
GPU memory calculator 🤗Accelerate	2	1870	July 5, 2024