Weird losses while fine tuning

marlon89 · September 17, 2021, 9:09am

Hello,

I got a question regarding fine tuning a BERT Sentiment model. I am working with Customer Feedback data and want to fine tune “nlptown/bert-base-multilingual-uncased-sentiment” for my specific usecase as there are always patterns for wrong classified sentiments no matter which pretrained model I use. However, when I fine tune it I get weird losses like this:
Screenshot (43)

These loss jumparounds appear in the same pattern no matter if I reduce the learning rate, choose different batch sizes or update my weight decay. I am working with a Trainer() insance. Nevertheless, if I run the new trained model with test data the results are looking fine. Now finally my question: Can I still use this model? The losses indicate overfitting, don’t they?

Topic		Replies	Views
Getting unexpected results for fine tuned bert model Beginners	0	271	February 9, 2024
Loss behaviour for bert fine-tuning on QNLI Models	3	4432	October 15, 2021
Is it possible to fine tune a Bert model using a small dataset (400 data)) Beginners	1	614	October 3, 2022
Questions about my first code on fine-tuning BERT model for text-classification Beginners	0	1510	April 26, 2022
Training Loss Higher than Validation Loss Beginners	0	432	August 3, 2022

Weird losses while fine tuning

Related topics