Training pauses every 500 steps

simonosgoode · October 11, 2022, 9:43pm

Hi there–newbie question. I’m training my first model (GPT2) and the trainer pauses every 500 steps to save the checkpoint. My impression is that this slows down training a lot. Am I right/wrong about that? And, more importantly, are there community best practices I should know about that maximize speed but keep good logs?

Thanks for your help,

S

Topic		Replies	Views
Disable checkpointing in Trainer 🤗Transformers	4	7796	January 10, 2022
500 error when autotraining model Beginners	0	262	February 1, 2024
All the training jobs end up getting stopped 🤗AutoTrain	6	2142	April 17, 2024
Impact of resuming from a checkpoint vs training/finetuning from the start 🤗Transformers	0	23	September 12, 2024
How to determine the number of warm-up steps? Beginners	0	368	April 18, 2024

Training pauses every 500 steps

Related topics