Training loss increases suddenly at the beginning of each epoch

yisyuan · March 29, 2022, 6:32am

Hi all,
I’ve used Transformers library for a while and tried different models (BERT, BART, ViT, etc.) with provided examples. However, I found that training loss will always increases suddenly at the beginning of each epoch. The figure below is an example:

At first, I thought that is because the training dataset is not shuffled after each epoch. However, there is a related topic indicates that Trainer class should handle this for us.

Does any one experience such a problem? Any comment would be really appreciated!

sgugger · March 29, 2022, 11:50am

Are you sure it’s not the validation loss logged wrongly?

Topic		Replies	Views
The training loss(logging steps) will drop suddenly after each epoch? Help me plz! Orz 🤗Transformers	1	1179	August 26, 2022
Loss behaviour for bert fine-tuning on QNLI Models	3	4414	October 15, 2021
Trainer's step loss always drops sharply after each epoch regardless of model / data 🤗Transformers	3	2159	March 28, 2023
Why my training loss drops at epoch boundaries? Beginners	4	1506	August 22, 2024
Sudden Loss Drop and Poor Performance During Model Training Intermediate	0	51	April 28, 2025

Training loss increases suddenly at the beginning of each epoch

Related topics