Which method is use HF Trainer with multiple GPU?

Indramal · December 1, 2022, 4:12pm

According to the following question, the trainer will handle multiple GPU work. What is the method it uses?

DataParallel (DP) or TensorParallel (TP) or PipelineParallel (PP) or DPP, what?

AndreaSottana · December 2, 2022, 3:04pm

Based on this line of code it looks like it is using nn.DataParallel, however I haven’t fully looked at every line of the Trainer class so they may also be using other methods in other points.

Indramal · December 3, 2022, 1:56am

Thank you for your information. Old Trainer documents have to configure that. But new document doese not mention it. Old Doc - Trainer — transformers 4.7.0 documentation

SUNM · June 19, 2023, 4:56am

hi @AndreaSottana , sorry I am trying to fine tune got-neo because of the Cuda memory issue I need to use multiple GPU. I use the trainer in hugging face which I understand it will use multiple GPu . but my results are very strange and very different than when I use 1 GPU. Would you please help me how you use multiple GPU for fine tunning the model.?

Many thanks

SUNM · June 19, 2023, 4:56am

Hi @Indramal
sorry I am trying to fine tune got-neo because of the Cuda memory issue I need to use multiple GPU. I use the trainer in hugging face which I understand it will use multiple GPu . but my results are very strange and very different than when I use 1 GPU. Would you please help me how you use multiple GPU for fine tunning the model.?

Many thanks

Topic		Replies	Views
Multi gpu training 🤗Transformers	3	6065	April 24, 2022
Training using multiple GPUs Beginners	20	20226	February 25, 2024
Using 3 GPUs for training with Trainer() of transformers 🤗Transformers	2	2361	October 18, 2023
Custom model with two pretrained models fails multi GPU training when using the Trainer 🤗Transformers	0	250	March 2, 2023
Trainer use multigpu 🤗Transformers	0	531	July 29, 2021

Which method is use HF Trainer with multiple GPU?

Related topics