Training using multiple GPUs

sgugger October 21, 2020, 4:48pm 8

The Trainer lets you compute the loss how you want by subclassing and overriding compute_loss (see an example here). By default we use the basic loss since that’s the use case of most users.

1 Like

Topic		Replies	Views
Finetuning GPT2 using Multiple GPU and Trainer 🤗Transformers	14	6791	May 22, 2023
Custom model with two pretrained models fails multi GPU training when using the Trainer 🤗Transformers	0	245	March 2, 2023
Multiple gpu training 🤗Transformers	1	2587	August 10, 2024
Which method is use HF Trainer with multiple GPU? 🤗Transformers	4	1564	June 19, 2023
How to run single-node, multi-GPU training with HF Trainer? 🤗Transformers	5	15224	October 16, 2024

Training using multiple GPUs

Related topics