I’m getting the following error in my logs when I click on Train:
FATAL ERROR: NVIDIA Management Library (NVML) not found.
HINT: The NVIDIA Management Library ships with the NVIDIA display driver (available at
https://www.nvidia.com/Download/index.aspx), or can be downloaded as part of the
NVIDIA CUDA Toolkit (available at https://developer.nvidia.com/cuda-downloads).
The lists of OS platforms and NVIDIA-GPUs supported by the NVML library can be
found in the NVML API Reference at https://docs.nvidia.com/deploy/nvml-api.
I’ve already added my HF_TOKEN, and tried:
Restarting the Space
Factory Reboot
Why is this error occurring? What can I do to fix this error?
For context, I’m creating this project to teach young students on how to leverage the no-code interface on Hugging Face and Teachable Machine Learning.
I also got this error when I tried autotrain app
Your installed package nvidia-ml-py is corrupted. Skip patch functions nvmlDeviceGet{Compute,Graphics,MPSCompute}RunningProcesses. You may get incorrect or incomplete results. Please consider reinstall package nvidia-ml-py via pip3 install --force-reinstall nvidia-ml-py nvitop.
Your installed package nvidia-ml-py is corrupted. Skip patch functions nvmlDeviceGetMemoryInfo. You may get incorrect or incomplete results. Please consider reinstall package nvidia-ml-py via pip3 install --force-reinstall nvidia-ml-py nvitop.
/Users/temme/Documents/pogo/framework-validator/autotrain/lib/python3.10/site-packages/bitsandbytes/cextension.py:34: UserWarning: The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable.
warn("The installed version of bitsandbytes was compiled without GPU support. "
‘NoneType’ object has no attribute ‘cadam32bit_grad_fp32’