Which loss function in bertforsequenceclassification regression

rgwatwormhill · October 8, 2020, 8:22pm

BertForSequenceClassification can be used for regression when number of classes is set to 1. The documentation says that BertForSequenceClassification calculates cross-entropy loss for classification. What kind of loss does it return for regression?

(I’ve been assuming it is root mean square error, but I read recently that there are several other possibilities such as Huber or Negative Log Likelihood.)

Which is it?

How should I find out / where is the code?

Karthik12 · October 9, 2020, 8:28am

This is the GitHub link

At line 1354, you have the condition to check the labels (if it is one or more)
if self.num_labels == 1:
# We are doing regression
loss_fct = MSELoss()
loss = loss_fct(logits.view(-1), labels.view(-1))
else:
loss_fct = CrossEntropyLoss()
loss = loss_fct(logits.view(-1, self.num_labels), labels.view(-1))

BramVanroy · October 9, 2020, 9:12am

You can select the lines that you are interested in on a Github code page, then click on the three dots and select copy permalink:

github.com

huggingface/transformers/blob/9aeacb58bab321bc21c24bbdf7a24efdccb1d426/src/transformers/modeling_bert.py#L1353-L1360


if labels is not None:
    if self.num_labels == 1:
        #  We are doing regression
        loss_fct = MSELoss()
        loss = loss_fct(logits.view(-1), labels.view(-1))
    else:
        loss_fct = CrossEntropyLoss()
        loss = loss_fct(logits.view(-1, self.num_labels), labels.view(-1))

Karthik12 · October 9, 2020, 9:50am

Thanks @BramVanroy, this makes the code easier to read. I will do so, in future.

rgwatwormhill · October 9, 2020, 1:06pm

Thank you !

theudster · February 24, 2021, 10:49am

Can you add parameters to the loss function through transformer? for example, add weights to each of the classes?

rgwatwormhill · February 24, 2021, 7:25pm

Hi theudster,

the pytorch docs for CrossEntropyLoss suggest that you can add a weight tensor CrossEntropyLoss — PyTorch 1.7.1 documentation . What happens if you try it?

theudster · February 25, 2021, 1:25pm

I haven’t tried that because I am trying to implement everything through the Trainer method

Topic		Replies	Views
Class weights for bertForSequenceClassification Beginners	10	12688	May 29, 2022
How do I do multi Class (multi head) classification? 🤗Transformers	6	4416	October 18, 2022
BertForSequenceClassification only seems to have linear activation at the end - is this a bug? 🤗Transformers	1	2894	September 30, 2020
Weighed Loss Function in Regression Task Intermediate	1	626	April 6, 2024
TypeError: cross_entropy_loss(): argument 'input' (position 1) must be Tensor, not SequenceClassifierOutput 🤗Transformers	2	6879	April 26, 2022

Which loss function in bertforsequenceclassification regression

Related topics