How to Use MS-Swift to Load a Model onto Multiple GPUs for Full Parameter Fine-Tuning?

Hello everyone,

I’m currently working on fine-tuning a large model using ms-swift, and I need to load the model onto multiple GPUs for efficient training. I’ve been looking through the available documentation and code, but I haven’t found clear examples or explanations on how to achieve this.

I would greatly appreciate it if anyone could share their experience or direct me to any relevant documentation or examples on how to configure and run ms-swift with multiple GPUs.

1 Like

It seems that several methods like FSDP are available by default?