Constrain device map to GPUs

MultiModal · February 24, 2023, 1:31pm

When I load a huge model like T5 xxl pretrained using device_map set to auto, and torch_dtype set to float16, it always insists on including my CPU when I have enough GPU ram (48 GB) how do I constrain accelerate to use only my GPUs? I tried setting the device_map manually with the layers spread over the GPUs but it gave an error.

Topic		Replies	Views
Accelerate not spreading on multiple CPUs 🤗Accelerate	1	1845	August 1, 2023
Move model with device_map="balanced" to CPU 🤗Transformers	1	6502	February 5, 2024
Infer_auto_device_map returns empty 🤗Accelerate	2	3351	March 15, 2023
Why am I out of GPU memory despite using device_map="auto"? 🤗Accelerate	3	19527	March 18, 2024
Running inference on flan-ul2 on multi-gpu 🤗Accelerate	8	4526	June 6, 2023

Constrain device map to GPUs

Related topics