Getting unexpected results for fine tuned bert model

weightedhuman · February 9, 2024, 10:49am

Hi, I am new to NLP domain and particularly huggingface. I recently learnt finetuning of models so decided to try it out. I fine tuned bert-base-cased model on news sentiment dataset. Everyting went well and I got a validation accuracy of around 85%. I checked the confusion matrix and the model is performing well.

However, when I tried to test it on some outside data (I passed some test news examples) the model is giving unexpected results. For straightforward negative news, it is showing a high positive score and for negative news it is showing high positive score. I initially thought it might be the issue with target encoding but turns out the results are totally vague. Sometimes the positive news is correctly identified but most of the time the results are totally unexpected.

Could someone please explain what could be the reasons behind this and what steps I need to take to analyse the problem?

Topic		Replies	Views
Is it possible to fine tune a Bert model using a small dataset (400 data)) Beginners	1	611	October 3, 2022
Expected Accuracy for fine tuning bert-base-cased model with Yelp data Beginners	0	79	June 5, 2024
Weird losses while fine tuning Beginners	0	337	September 17, 2021
Ensuring Consistency in Results: A Focus on Reproducibility BERT 🤗Transformers	2	85	October 3, 2024
How to fine-tune BERT model for next word prediction? Beginners	0	1112	October 3, 2021

Getting unexpected results for fine tuned bert model

Related topics