Manual pipeline parallelization with DeepSpeed

sheldon-spock · January 7, 2023, 6:07am

Hello folks!

I am looking to finetune a pipeline and I have a stack of models (say A, B and C) in the pipeline, one of which (B) is an LLM (GPT-J) that occupies a lot of memory.

I want to split model B on 2 GPUs and add the other models in the pipeline on these GPUs in order (I.e. A and half of B on GPU_0 ; other half of B on GPU_1 and C on GPU_1).

Does DeepSpeed allow manual model parallelization, i.e. can I decide which model goes on what GPU? If not, what would be the best way to achieve pipeline parallelization with a split model?

Thanks!

Topic		Replies	Views
Model parallel with deepspeed integration Beginners	0	639	September 14, 2021
Multi GPU training - Model parallelism DeepSpeed	1	1881	February 2, 2024
Tensor parallelism for customized model 🤗Accelerate	0	227	September 2, 2024
Parallelizing huggingface models DeepSpeed	0	350	July 24, 2023
Model Parallelism and Pipelining for Model Training Beginners	3	3299	April 11, 2024

Manual pipeline parallelization with DeepSpeed

Related topics