Can't inference on the train model due to some cuda problem

tijanam · June 6, 2023, 11:52am

I have fine tuned a model with a single GPU by setting

import os
os.environ[‘CUDA_VISIBLE_DEVICES’]=‘6’
device=torch.device(‘cuda:6’ if torch.cuda.is_available() else ‘cpu’)

but when I am in inference mode, I am getting the following error

"Traceback (most recent call last):
File “/TransformerModels/TransformerRoBERTaTrial.py”, line 237, in
inputs = tokenizer(smiles, return_tensors=“pt”, padding=‘max_length’, truncation=True, max_length=250).to(device) #max_length=195
File “anaconda3/envs/lm_hugg/lib/python3.9/site-packages/transformers/tokenization_utils_base.py”, line 759, in to
self.data = {k: v.to(device=device) for k, v in self.data.items()}
File “anaconda3/envs/lm_hugg/lib/python3.9/site-packages/transformers/tokenization_utils_base.py”, line 759, in
self.data = {k: v.to(device=device) for k, v in self.data.items()}
RuntimeError: CUDA error: invalid device ordinal
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.

How to solve the problem?

Thank you

Topic		Replies	Views
CUDA error: device-side assert triggered on device_map="auto" 🤗Transformers	4	1628	December 8, 2024
AssertionError: Torch not compiled with CUDA enabled 🤗Transformers	0	2938	June 1, 2023
RuntimeError: CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Compile with `TORCH_USE_CU 🤗Transformers	2	1045	November 1, 2024
RuntimeError: CUDA error: device-side assert triggered 🤗Transformers	1	2500	April 28, 2021
Need help performance issues transformers.AutoModelForCausalLM.from_pretrained( 'mosaicml/mpt-7b-instruct' Beginners	0	930	June 12, 2023

Can't inference on the train model due to some cuda problem

Related topics