Explicitly set number of training steps using Trainer

aclifton314 · September 14, 2020, 7:11pm

I’m using Trainer to handle finetuning a GPT2 model. I see in TrainingArguments there is a max_steps that overrides num_train_epochs.

For a batch size of 32, is setting max_steps=1000000 the equivalent of setting num_train_epochs=31250?

Also, what happens if I have a batch size of 6 but want to set max_steps=1000000? Does Trainer stop training after the nearest divisible whole number or does it change the batch size at the end?

Thanks in advance!!

sgugger · September 14, 2020, 9:00pm

The number of steps is the number of update steps. It’s not the number of training examples to be seen.

aclifton314 · September 14, 2020, 9:03pm

Ok. Is it then the case that for a batch size of 32, setting max_steps=1000000 is the same as setting num_train_epochs=31250 ?

sgugger · September 15, 2020, 12:01pm

No (unless your dataset has 32 batches so is of length 32*32). num_train_epochs = max_steps / len(train_dataloader) in general.

aclifton314 · September 15, 2020, 4:45pm

Apologies for my confusion. Is then max_steps the maximum number of times a batch makes a forward and backward pass through the network?

sgugger · September 16, 2020, 1:06am

Not exactly it is the number of update steps (so if you’re using gradient accumulation, it’s a bit different than just doing forward and backward).

Topic		Replies	Views
TrainingArguments class - max_steps formula when using streaming dataset 🤗Transformers	1	3682	September 14, 2023
How does `max_steps` affect the number of samples the model "sees"? Beginners	4	3823	January 19, 2024
Is it possible to set epoch less than 1 when using Trainer 🤗Transformers	1	1275	June 18, 2022
Regarding max steps, streaming in language modeling 🤗Optimum	3	235	April 13, 2024
How do you calculate max steps Beginners	2	2294	July 28, 2023

Explicitly set number of training steps using Trainer

Related topics