Trainer API for Model Parallelism on Multiple GPUs

zcakzhu · August 4, 2023, 2:38pm

Thanks for your reply! It is super helpful It is great to know that by just running python {myscript.py} the class will use model parallelism.

A follow-up question from me is, how is the Trainer’s model parallelism differ from Deepspeed and FSDP? Is there any documentation that I can read into to gain more knowledge of what is happening at the backend?

Thanks a lot!

Topic		Replies	Views
Model parallel with deepspeed integration Beginners	0	658	September 14, 2021
Which method is use HF Trainer with multiple GPU? 🤗Transformers	4	1581	June 19, 2023
Model Parallelism, how to parallelize transformer? Beginners	3	12836	June 18, 2021
Using Transformers with DistributedDataParallel — any examples? Intermediate	11	23793	May 8, 2023
Basics for Multi GPU Training with Huggingface Trainer 🤗Transformers	0	2708	June 14, 2023

Trainer API for Model Parallelism on Multiple GPUs

Related topics