Linear learning rate despite lr_scheduler_type="polynomial"

kaankork · September 1, 2021, 4:07pm

Hello,

While fine-tuning my network, I would like to set up a polynomial learning rate scheduler by setting lr_scheduler_type="polynomial" and learning_rate=0.00005.

However, when I visualize the learning rate on the wandb dashboard, I’m observing a linear decrease of the learning rate instead of polynomial. (Screenshot below)

What might be causing the issue? I’ve tested without setting the learning_rate and the behavior was exactly the same.

sgugger · September 1, 2021, 7:57pm

Polynomial scheduler is not really supported via this argument, as it requires an additional power keyword argument that defaults to 1 and which can’t be set via this API. You should thus set the scheduler directly in the Trainer.

kaankork · September 2, 2021, 12:37pm

@sgugger I previously tested with lr_scheduler_type=“polynomial” and lr_scheduler_type="linear" to compare the outcome and I ended up getting different accuracy results, which makes me wonder - if the polynomial scheduler is not supported via this argument, shouldn’t I get exactly the same results?

Thank you for the info, I was able to set the scheduler directly in the Trainer.

sgugger · September 2, 2021, 12:38pm

Did you use the exact same seed as well?

kaankork · September 2, 2021, 12:57pm

Yes, the seed was set using set_seed(42).

Topic		Replies	Views
Learning rate zero? 🧨 Diffusers	1	755	March 31, 2023
Seq2Seq Learning rate Intermediate	2	382	March 6, 2024
Hyperparameters for lr_scheduler_type in Trainer Arguments Beginners	2	12469	March 5, 2024
Which parameter is causing the decrease in Learning rate every epoch? Beginners	2	1129	December 21, 2021
Linear Learning Rate Warmup with step-decay Beginners	4	3267	April 21, 2021

Linear learning rate despite lr_scheduler_type="polynomial"

Related topics