Does starting training from a previous checkpoint reset the learning rate?

md1630 · August 14, 2021, 10:02pm

Hi,

I want to start training a new model by loading a previous model I trained, I want to know what happens to the learning rate in this case – does it start at the learning rate I set, or does it start from the prev learning rate of the checkpoint?

ugaray96 · January 20, 2022, 10:13am

Hi!

As far as I have experienced, it continues from the last learning rate number saved in the checkpoint from where you resumed.
It also starts from the last epoch number.

jbmaxwell · August 20, 2022, 11:17pm

I’m sure you’ve long since realized this (or maybe never had the problem), but I thought I’d mention it for the benefit of anyone else with questions about resuming.

I embarrassingly failed to give the checkpoint to both my model and my trainer, and only this morning realized that it won’t resume correctly this way. At the very least it will fail to continue from the correct training epoch and step, but I think it also impacts the scheduler in more subtle ways, since I got significantly poorer training results when loading the checkpoint only to the model, even though I manually set the learning rate (i.e., to the one stored in the checkpoint). Not sure exactly why this is… (non-linear learning rate decay, maybe?)

Anyway, hopefully this helps someone somewhere sometime.

Topic		Replies	Views
Resume Training with Lower Learning Rate Beginners	3	1330	January 5, 2025
Learning rate and checkpoints 🤗Transformers	0	436	March 29, 2022
Impact of resuming from a checkpoint vs training/finetuning from the start 🤗Transformers	0	25	September 12, 2024
Cannot Resume Training Beginners	1	1375	December 15, 2020
Does "resume_from_checkpoint" work? Beginners	0	968	June 19, 2022

Does starting training from a previous checkpoint reset the learning rate?

Related topics