How to fine-tune Bert on STS-B task?

PososAgapo · August 4, 2021, 7:26am

Hi, I am new to NLP and trying to reproduce fine-tune results of Bert. However, the STS-B task troubles me, from what I understand, the STS-B task is a regression task, but Bert treats it as a classification task. I do not quite know the transformation between scores and labels in detail, is anybody willing to give me a hint?

sgugger · August 4, 2021, 7:54am

This is all dealt with in the loss function: a model that is tasked with classification or regression is the same roughly, it just outputs a different number of labels. Inside the code of BertModelForSequenceClassification, you can see there is a test that picks a different loss function depending on the problem_type, and by default 1 label (like in STS-B) corresponds to a regression, so the mean-squared error is selected as a loss, instead of cross-entropy.

PososAgapo · August 4, 2021, 8:00am

Thank you for your detailed reply, it saves me

Topic		Replies	Views
Weighed Loss Function in Regression Task Intermediate	1	624	April 6, 2024
Which loss function in bertforsequenceclassification regression Beginners	7	15528	February 25, 2021
Default `problem_type` Beginners	2	3324	May 25, 2022
Questions about my first code on fine-tuning BERT model for text-classification Beginners	0	1509	April 26, 2022
Fine-tune BERT and Camembert for regression problem Beginners	17	11204	December 4, 2021

How to fine-tune Bert on STS-B task?

Related topics