Device_map="auto"

hansekbrand · August 6, 2023, 5:49am

It loads a model onto multiple GPUs. Once loaded, the model can be run forward or backward. I have only used ”auto” for training as of yet, and it works.

If you refer to this section:

This only supports the inference of your model, not training. Most of the computation happens behind torch.no_grad() context managers to avoid spending some GPU memory with intermediate activations.

I think it only applies to the offloading to CPU or disk mechanism, but not when the full model can be loaded onto several GPUs.

Topic		Replies	Views
Using device_map='auto' for training 🤗Accelerate	5	35248	January 24, 2025
Anywhere where I can read more about the `device_map` kwarg in `from_pretrained`? Beginners	2	13034	January 5, 2024
Why am I out of GPU memory despite using device_map="auto"? 🤗Accelerate	3	17240	March 18, 2024
Load_in_8bit requires device_map but also does not support it 🤗Transformers	0	2777	December 19, 2022
Infer_auto_device_map returns empty 🤗Accelerate	2	3198	March 15, 2023

Device_map="auto"

Related topics