Tensor not on the same device after using BitsandBytesConfig

gfuiu · June 27, 2024, 2:55am

Hello, I am a beginner to llm and I was trying to use Qlora to fine tune the llama3-7B model. I was dealing with a text classification problem, so I used AutoModelForSequenceClassification(model_name, quantization_config=bnb_config, num_labels=100), and I use deepspeed zero2 to reduce the memory used in each gpu. When I removed the quantization_config in AutoModelForSequenceClassification, the code runs perfectly, but when I add this parameter, the error goes like: Expected all tensors on the same device. Does anyone know what is going on there? Thanks for your time!

Topic		Replies	Views
BitsAndBytes With DDP 🤗Transformers	3	101	October 7, 2024
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:2 and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select) DeepSpeed	5	3484	August 26, 2024
BitsAndBytesConfig is not compitable in TPU env 🤗Transformers	2	250	July 6, 2024
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! I am on a single T4 GPU 🤗Accelerate	6	1238	June 10, 2024
Expected all tensors to be on the same device Beginners	3	9523	April 30, 2022

Tensor not on the same device after using BitsandBytesConfig

Related topics