Would PyTorch's FSDP work with a model loaded using device_map='auto'?

ChristopherTan · April 17, 2024, 3:24am

I’m guessing no because FSDP has a very specific way that it distributes the model over the GPUs and device_map=‘auto’ might not align with that. Is my understanding correct? Is that why training usually does not work with device_map=‘auto’?

Topic		Replies	Views
Using device_map='auto' for training 🤗Accelerate	5	36202	January 24, 2025
How to use FSDP + DPP in Trainer 🤗Transformers	1	1002	April 24, 2023
Device_map="auto" Beginners	5	20045	September 25, 2024
Does accelerate API support FSDP on TPU Pods? (accelerate config doesn't seem to allow this) 🤗Accelerate	0	406	October 8, 2023
While training a T5Small model using FSDP, the model does not learn 🤗Accelerate	1	852	April 15, 2024

Would PyTorch's FSDP work with a model loaded using device_map='auto'?

Related topics