What is "scheduled LR warm-up"?

tsei902 · March 25, 2023, 6:27pm

Hi everyone!

I am optimizing the finetuning of my T5 model and come across this entry on the HF website (Optimization)

where it is recommended to “use scheduled LR warm-up” with Adafactor when training T5. What is that and how can I implement the latter? Did anyone do this before?

Many thanks in advance.

Topic		Replies	Views
How to use AdaFactor on TPU? Beginners	0	342	August 19, 2021
How is the AdafactorScheluder suppose to be used? Models	5	4029	January 8, 2024
T5 training with Trainer, w/ AdaFactor 🤗Transformers	0	955	February 12, 2023
T5 models, when or why convert to HF? Beginners	0	272	March 7, 2023
T5 Finetuning Tips Models	48	56625	November 3, 2024

What is "scheduled LR warm-up"?

Related topics