Device_map="auto"

Kenkentron · August 5, 2023, 7:50pm

I was wondering if this parameter is only set for inference as the document states (Handling big models for inference) or does it actually have an effect during training?

Thanks!

hansekbrand · August 6, 2023, 5:49am

It loads a model onto multiple GPUs. Once loaded, the model can be run forward or backward. I have only used ”auto” for training as of yet, and it works.

If you refer to this section:

This only supports the inference of your model, not training. Most of the computation happens behind torch.no_grad() context managers to avoid spending some GPU memory with intermediate activations.

I think it only applies to the offloading to CPU or disk mechanism, but not when the full model can be loaded onto several GPUs.

Kenkentron · August 7, 2023, 2:51pm

Thanks @hansekbrand that is helpful.

With more search, I think it became clear to me that device_map=“auto” is doing naive MP for training (ref: Make all Transformer models compatible with model parallelism · Issue #22561 · huggingface/transformers · GitHub).

muellerzr · August 7, 2023, 2:53pm

Correct, any form of distributed training aside MP is not supported, and as of the next version will raise a proper error if you try to do so:

Kenkentron · August 9, 2023, 5:04pm

Thanks @muellerzr, could you also take a look at a related problem I have ZeRO uses more RAM than DDP??

vergilus · September 25, 2024, 9:27am

training mode does not support device_map=“auto”, it will throw an error suggesting you load model on single device.

Topic		Replies	Views
Using device_map='auto' for training 🤗Accelerate	5	37691	January 24, 2025
Infer_auto_device_map returns empty 🤗Accelerate	2	3328	March 15, 2023
How to load model on multiple GPUs for inference? Beginners	0	763	September 28, 2023
Trainer API for Model Parallelism using AutoModelForQuestionAnswering 🤗Transformers	1	160	June 5, 2024
Would PyTorch's FSDP work with a model loaded using device_map='auto'? 🤗Transformers	0	272	April 17, 2024

Device_map="auto"

Related topics