Are the performance tricks from v4.18.0 relocated in the main branch site?

alvations · October 31, 2022, 1:41pm

Was wondering about what happened to the tips/tricks recommended on Performance and Scalability: How To Fit a Bigger Model and Train It Faster since it’s replace by a more comprehensive page on Performance and Scalability but didn’t find the same tricks:

Is the tips/tricks on Performance and Scalability: How To Fit a Bigger Model and Train It Faster still relevant with the latest version of transformers?

Or has it been moved to another location of the main branch site?

sgugger · November 1, 2022, 12:10pm

cc @lvwerra and @stas who reorganized this content.

lvwerra · November 1, 2022, 12:22pm

hi @alvations, these tricks are now in the one-gpu section: Efficient Training on a Single GPU

alvations · November 1, 2022, 2:43pm

Thanks @lvwerra for the pointers to the subpage!

Topic		Replies	Views
Finetuning and single-GPU utilization 🤗Transformers	0	489	August 19, 2021
Gradient clipping on Transformers 🤗Transformers	0	253	December 20, 2023
New Trainer Doc no some properties but Old Doc have (n_gpu, parallel_mode) 🤗Transformers	3	301	December 6, 2022
Does anyone have working code for training T5-11B on multi-gpu? DeepSpeed	4	1047	March 30, 2023
Baffling performance issue on most NVidia GPUs with simple transformers + pytorch code Intermediate	5	4508	April 9, 2024