How to Use MS-Swift to Load a Model onto Multiple GPUs for Full Parameter Fine-Tuning?

stringofu · December 2, 2025, 9:35am

Hello everyone,

I’m currently working on fine-tuning a large model using ms-swift, and I need to load the model onto multiple GPUs for efficient training. I’ve been looking through the available documentation and code, but I haven’t found clear examples or explanations on how to achieve this.

I would greatly appreciate it if anyone could share their experience or direct me to any relevant documentation or examples on how to configure and run ms-swift with multiple GPUs.

John6666 · December 2, 2025, 11:57am

It seems that several methods like FSDP are available by default?

Topic		Replies	Views
Fine tune mt5 model on single gpu? Models	0	344	September 24, 2021
Error occurs when loading additional parameters in multi-gpu training Beginners	1	365	December 14, 2021
Running mT5 on multiple GPUs Models	0	534	July 26, 2022
Fine tune large model on a single gpu Models	0	338	November 30, 2022
Multi GPU fintuning BART 🤗Transformers	3	1667	July 11, 2020

How to Use MS-Swift to Load a Model onto Multiple GPUs for Full Parameter Fine-Tuning?

Related topics