GPU not being used in Spaces Deployment

tvkarthik · January 9, 2025, 3:20pm

I have a deployment in my Space. But the GPU is not being used. I installed CUDA 11.8 and and I am using torch.

John6666 · January 9, 2025, 3:27pm

Is it zero GPU space?
If so, you’ll need to follow a slightly different procedure.
If not, that’s strange. Maybe you forgot to .to(“cuda”) the model or pipeline.

tvkarthik · January 9, 2025, 3:41pm

Hi John,
Thank you for the quick response.

When i send the request to the inference i see that the CPU is being used but GPU is not getting picked up although i have included .to(“cuda”). I am using the NVIDIA 1xL4. Just a snapshot below. When request is sent CPU touches 44% but GPU is not used.

John6666 · January 9, 2025, 4:09pm

It’s getting weirder and weirder…
The only thing that’s unusual is that the CUDA version is much older than the 12.4 that is common in HF, but I think it works even if it’s older…
And if there’s not enough VRAM, it should offload properly if accelerate is installed.

I think trying to load some unrelated model into the GPU might help isolate the problem.

Topic		Replies	Views
Does ZeroGPU not work for all spaces? Beginners	5	1505	February 28, 2025
Issue with CUDA Availability on A10 GPU Instance of space Spaces	2	618	November 8, 2023
I am using zero gpu put the embedings isnt working Spaces	2	107	February 28, 2025
Can't Access ZeroGPU (NVIDIA A100) even though it is enabled Beginners	1	121	March 5, 2025
Spaces - Running CUDA 11.8 / PyTorch 2 in spaces Spaces	2	1315	July 10, 2024

GPU not being used in Spaces Deployment

Related topics