How to use GPU when using transformers.AutoModel

Jaglinux · February 3, 2024, 10:53pm

from transformers import AutoModel
device = "cuda:0" if torch.cuda.is_available() else "cpu"
model = AutoModel.from_pretrained("<pre train model>")
self.model(<tokenizer inputs>).to(device)

The above code fails on GPU device.
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)

It fails because weights of the pre trained model is on CPU and the input data is on GPU.
Is there a parameter to pass in AutoModel.from_pretrained() to make it work on GPU ?
https://huggingface.co/transformers/v3.0.2/_modules/transformers/configuration_auto.html#AutoConfig.from_pretrained

Topic		Replies	Views
Is Transformers using GPU by default? Beginners	6	154677	December 11, 2023
Need help performance issues transformers.AutoModelForCausalLM.from_pretrained( 'mosaicml/mpt-7b-instruct' Beginners	0	930	June 12, 2023
Move model with device_map="balanced" to CPU 🤗Transformers	1	6223	February 5, 2024
BLOOM models don't run on my GPU using Transformers 🤗Transformers	1	1661	September 18, 2022
How to get the Trainer API to use GPU? Beginners	0	1563	May 21, 2021

How to use GPU when using transformers.AutoModel

Related topics