Metrics mismatch between BertForSequenceClassification Class and my custom Bert Classification

rgwatwormhill · December 10, 2020, 11:44pm

It’s a good question, but I don’t know the answer, sorry.

(When I tried to add a custom head to a BERT model, I couldn’t get it to learn at all!).

How much different is the accuracy? If it’s only a bit, then it could be just random chance.

When you fine-tune, are you freezing the main BERT layers? I think by default fine-tuning will propagate back into the main layers, which might not be what you want. Not sure that would be any different with the official SequenceClassification head though.

Have you looked at the code that is used for the official SequenceClassification head? This post Which loss function in bertforsequenceclassification regression includes a link to the GitHub page for the code.

Topic		Replies	Views
Weights not downloading Beginners	3	1844	May 24, 2021
Trying to understand XForSequenceClassification heads Intermediate	8	1323	September 24, 2020
How do i take only "BERT" weights from BertForSequenceClassification model? 🤗Transformers	0	1445	February 16, 2022
Further Pretrain Basic BERT for sequence classification 🤗Transformers	4	1810	October 9, 2020
Fine-Tune BERT with two Classification Heads "next to each other"? Beginners	3	2690	September 17, 2021

Metrics mismatch between BertForSequenceClassification Class and my custom Bert Classification

Related topics