Interpreting train_loss/val_loss Plot

mox · March 24, 2023, 10:46am

Hi,

I have fine-tuned a BERT model for a binary text classification task. The model achieves an F1 Score of ~0.9, but when plotting the Train and the Validation-Loss, the plot looks weird and I am not sure how to interpret this.
Here is the plot

What is this plot saying? Is there overfitting happening?

Thanks & regards,
Max

deathcrush · March 24, 2023, 12:24pm

The best way to check overfitting is to also plot your classification performance on your validation set and see what happens. If it goes down, then it overfits. Otherwise, you may conclude that the NLL is not a very useful predictor of task-specific performance.

mox · March 24, 2023, 12:38pm

The classification performance (F1, Precision, Recall) on the validation dataset slightly increases, so is getting better

deathcrush · March 24, 2023, 2:12pm

Assuming the validation set is large enough and well constructed, you may conclude that you are not overfitting on the training set (otherwise your classifier would get worse as you train) despite the loss indicating so.

Topic		Replies	Views
Seeking Clarification: Model Evaluation - Train and Val loss 🤗Transformers	3	716	April 10, 2024
Bert for Text classification evaluation - help needed Beginners	0	198	September 7, 2023
Plot Loss Curve with Trainer() Beginners	9	17874	November 24, 2021
Validation loss shows 'No log' 🤗Transformers	0	831	November 25, 2022
Training Loss Higher than Validation Loss Beginners	0	432	August 3, 2022

Interpreting train_loss/val_loss Plot

Related topics