How to make huge LM fit to multi GPU?

gksdnrwp · July 20, 2022, 9:11am

I want to implement pipeline parallelism of Huge LMs(saved in huggingface hub) that do not fit into single gpu. I tried to use Deepspeed, however LM from transformers can not be piped as it is not sequential.
So is there any API that convert transformers to sequential?

Or is there any better way to make Huge LMs to fit into multi gpu?

Thanks in advance.

Topic		Replies	Views
Model parallel with deepspeed integration Beginners	0	646	September 14, 2021
Model Parallism DeepSpeed	0	186	April 21, 2024
Parallelizing huggingface models DeepSpeed	0	351	July 24, 2023
Manual pipeline parallelization with DeepSpeed DeepSpeed	0	779	January 7, 2023
Model Parallelism, how to parallelize transformer? Beginners	3	12767	June 18, 2021

How to make huge LM fit to multi GPU?

Related topics