Does accelerate API support FSDP on TPU Pods? (accelerate config doesn't seem to allow this)

drwslacy47 · October 8, 2023, 8:46pm

I see that PyTorch/XLA FSDP is supported using the Trainer API as described here:

But what if I’m using the accelerate API instead of Trainer? When I run accelerate config and I specify TPUs as the platform, I don’t see any of the options to configure FSDP that I see when I specify multi-GPU as the platform. So, that seems to imply that accelerate doesn’t currently support FSDP on TPUs. Does that mean if use accelerate on a TPU pod that the parallelization strategy is just plain old (non-sharded) Data Parallelism? That’s a non-starter for large transformer models since the complete model isn’t going to fit on a single TPU.

Bottom Line: If I want to use FSDP to train on a TPU pod, does that mean I’m forced to use the Trainer API instead of accelerate?

Topic		Replies	Views
Trainer API for Model Parallelism on Multiple GPUs 🤗Transformers	5	4147	September 10, 2024
Accelerate TPU training 🤗Accelerate	0	129	July 5, 2024
How to use FSDP + DPP in Trainer 🤗Transformers	1	999	April 24, 2023
Would PyTorch's FSDP work with a model loaded using device_map='auto'? 🤗Transformers	0	244	April 17, 2024
Not seeing memory benefit to accelerate/FSDP2 🤗Accelerate	3	38	June 18, 2025

Does accelerate API support FSDP on TPU Pods? (accelerate config doesn't seem to allow this)

Related topics