Linear Learning Rate Warmup with step-decay

lewtun · April 19, 2021, 7:42am

hey @adaptivedecay, you can define your own scheduler for the learning rate by subclassing Trainer and overriding the create_scheduler function to include your logic: Trainer — transformers 4.5.0.dev0 documentation

alternatively, you can pass the optimizer and scheduler as a tuple in the optimizers argument.

Topic		Replies	Views
Use torch.optim.lr_scheduler.CyclicLR with Trainer 🤗Transformers	0	419	May 12, 2023
Trainer Ignoring Weight Decay, Beta arguments Beginners	1	888	July 28, 2023
How to adjust the learning rate after N number of epochs? Beginners	1	778	August 10, 2021
Huggingface LR Decay Schedulers Spend the first epoch w/ an LR of 0 🤗Transformers	1	788	December 27, 2022
How to create the warmup and decay from the BERT/Roberta papers? 🤗Transformers	2	7383	November 18, 2020