Issue with CUDA Availability on A10 GPU Instance of space

tanahhh · November 8, 2023, 3:27am

We have been using the Huggingface Spaces to host a model demo, utilizing the A10 GPU instance. Until recently, everything was functioning as expected and our demo was running smoothly.

However, we have suddenly encountered an issue where our demo ceased to operate. Upon investigation, we found out that we are no longer able to utilize CUDA with PyTorch on our instance. Specifically, when we run the code torch.cuda.is_available(), it returns False. Moreover, the error message suggests that the issue might be related to an outdated version of the CUDA driver.

Anyone knows of a solution to this problem?

radames · November 8, 2023, 6:49am

@chris-rannou i @tanahhh ,

Could you please share more about your Space? Are you using a Docker or Gradio SDK?
cc @chris-rannou maybe recent internal infra change has impact on this?

tanahhh · November 8, 2023, 10:00am

@radames

We don’t use Docker.
Our project has just requirements.txt and app.py. Each code is the following.

accelerate
protobuf
sentencepiece
torch>=2.0.1
pillow
transformers

import gradio as gr
import torch

if __name__ == "__main__":
    device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
    print(device, flush=True)

Topic		Replies	Views
HF Space: RuntimeError: CUDA error: no kernel image is available for execution on the device Spaces	0	1717	October 26, 2022
Spaces - Running CUDA 11.8 / PyTorch 2 in spaces Spaces	2	1347	July 10, 2024
How do I fix the "RuntimeError: CUDA error: CUDA driver version is insufficient for CUDA runtime version CUDA kernel errors might be asynchronously reported at some other API call,so the stacktrace below might be incorrect." error? Spaces	12	90779	July 13, 2023
Need Example Docker File with GPU Support Spaces	4	2205	October 12, 2023
Conflict in pytorch CUDA version and space CUDA version Spaces	0	408	May 25, 2024

Issue with CUDA Availability on A10 GPU Instance of space

Related topics