Hugging Face Forums
RuntimeError: CUDA error: CUBLAS_STATUS_EXECUTION_FAILED when calling `cublasGemmEx( handle, opa, opb, m, n, k, &falpha, a, CUDA_R_16F, lda, b, CUDA_R_16F, ldb, &fbeta, c, CUDA_R_16F, ldc, CUDA_R_32F, CUBLAS_GEMM_DEFAULT_TENSOR_OP)
🤗Transformers
nielsr
September 30, 2024, 1:36pm
3
Hi,
Would recommend the following:
Training Model on CPU instead of GPU - #2 by sgugger
.
1 Like
show post in topic
Related topics
Topic
Replies
Views
Activity
CUDA Runtime Error in the Middle of Training
Intermediate
1
771
March 30, 2024
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:2 and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)
DeepSpeed
5
3196
August 26, 2024
cuBLAS error 13 when running code with langchain.llms on GPU
🤗Accelerate
0
237
May 6, 2024
RoBERTa fine-tuning, CUBLAS_STATUS_NOT_SUPPORTED
Beginners
0
954
December 20, 2022
Getting CUDA out of memory when calling save_pretrained in a script that tries lora training a large language model
Beginners
3
1690
November 9, 2023