Why don't I have access to all the GPU's VRAM?

cumprod · March 10, 2023, 3:05am

I run out of memory for my space very often. When I do, it shows something like this:
RuntimeError: CUDA out of memory. Tried to allocate 512.00 MiB (GPU 0; 7.93 GiB total capacity; 4.04 GiB already allocated; 470.94 MiB free; 4.16 GiB reserved in total by PyTorch)
My question is…why do I have only 4GB of VRAM to play with?
Why is the rest of the memory reserved?
I have access to all 12GB of T4 VRAM on Google Colab and never run out of memory there.
What am I doing wrong?
Am I being stupid?

Thank you and God Bless

radames · March 10, 2023, 6:53pm

hi @cumprod were you using a T4 - small instance? can you share your Space link?

cumprod · March 11, 2023, 1:22am

Yep I’m using a T4 small instance.
https://huggingface.co/spaces/cumprod/xbox

Thanks for looking

Topic		Replies	Views
Uploading a space on paid GPU's Spaces	4	21	June 6, 2025
CUDA out of memory on Nvidia A10G + Codellama on HuggingFace Spaces Beginners	6	499	February 8, 2024
Can't load huge model onto multiple GPU's Beginners	5	5197	June 15, 2023
CUDA memory suddenly run out of space when only used a quarter of memory Models	0	1132	January 7, 2023
[Diffusers] PyTorch running out of memory 🧨 Diffusers	1	774	August 30, 2022

Why don't I have access to all the GPU's VRAM?

Related topics