How to specify FSDP config without launching via Accelerate

johngiorgi · September 23, 2024, 3:03pm

Hi, I am trying to use FSDP via the HF Trainer.

I am not launching the job with Accelerate because I am using Ray. However, most of the tutorials and documentation (example 1, example 2) assume the job will be launched with Accelerate and therefore demonstrate creating a config with accelerate config --config_file fsdp_config.yaml. They show a resulting config that looks something like:

fsdp_config:
  fsdp_auto_wrap_policy: TRANSFORMER_BASED_WRAP
  fsdp_backward_prefetch: BACKWARD_PRE
  fsdp_cpu_ram_efficient_loading: true
  fsdp_forward_prefetch: false
  fsdp_offload_params: false
  fsdp_sharding_strategy: FULL_SHARD
  fsdp_state_dict_type: SHARDED_STATE_DICT
  fsdp_sync_module_states: true
  fsdp_use_orig_params: false

I am trying to create a comparable config via the Trainer’s arguments: fsdp and fsdp_config. I am able to line up most arguments from Accelerate’s fsdp_config.yaml, but for others I am less sure. I was hoping to get some guidance on the following:

Does fsdp_auto_wrap_policy: TRANSFORMER_BASED_WRAP map to the argument "auto_wrap", passed via TrainingArguments.fsdp?
Does fsdp_offload_params: true map to the argument "offload", passed via TrainingArguments.fsdp?
I don’t see how to specify fsdp_state_dict_type at all via TrainingArguments… does it default to SHARDED_STATE_DICT?

johngiorgi · October 18, 2024, 2:18pm

For anyone else who lands here, you can dig through the __post_init__ method of the HuggingFace training to figure out the mapping of FSDP args between HF <> Accelerate.

system · October 19, 2024, 2:25am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to create the fsdp_config json file for Trainer? Intermediate	4	3009	June 19, 2023
Where can I find the full list of parameters for the Accelerate yaml config? 🤗Accelerate	3	68	June 5, 2025
Does accelerate API support FSDP on TPU Pods? (accelerate config doesn't seem to allow this) 🤗Accelerate	0	412	October 8, 2023
How to start fsdp2 when using trainer? 🤗Transformers	0	156	April 23, 2025
Accelerate FSDP config prompts 🤗Accelerate	5	4298	September 15, 2023

How to specify FSDP config without launching via Accelerate

Related topics