Automatically cast input to model's device map

souryadey · March 11, 2024, 8:51pm

Is there a way to automatically infer the device of the model when using auto device map, and cast the input (prompt IDs) to that?

Here’s what I have now:

DEVICE = "cuda" if torch.cuda.is_available() else "mps" if torch.backends.mps.is_available() else "cpu"

tokenizer = transformers.AutoTokenizer.from_pretrained(<model_id>)
model = transformers.AutoModelForCausalLM.from_pretrained(<model_id>, device_map = 'auto')

prompt = "Hello how are you"
prompt_obj = tokenizer(prompt, return_tensors = 'pt').to(DEVICE)
# proceed with model.generate

Instead of hardcoding DEVICE, I’d like to infer it from the model’s device map. Something like:

# inferred_device = <some code that maybe involves model.hf_device_map>
prompt_obj = tokenizer(prompt, return_tensors = 'pt').to(inferred_device)

Is there a way to do this?

Topic		Replies	Views
Infer_auto_device_map returns empty 🤗Accelerate	2	3268	March 15, 2023
Device_map="auto" with error: Expected all tensors to be on the same device Beginners	7	6765	January 5, 2025
CUDA error: device-side assert triggered on device_map="auto" 🤗Transformers	4	1641	December 8, 2024
Device_map="auto" Beginners	5	20362	September 25, 2024
Accelerate device_map for 🧨.from_pretrained 🧨 Diffusers	0	770	October 13, 2022

Automatically cast input to model's device map

Related topics