SequenceClassification num_labels<2 doesn't work on trainer

bhavnicksm · December 28, 2021, 2:25pm

I was trying out running AutoModelForSequenceClassification for num_labels = 1 and for num_labels =2 but no matter what changes I did, the Trainer module kept throwing some or the other unfixable error.

In num_labels=1, it worked but the model learnt absolutely nothing with the regression loss. I think there should be a BinaryCrossEntropyLoss there instead of the MSELoss because it just doesn’t do a good job in training the model.

For num_labels=2, unless you turn on the multi-label-problem=True, it doesn’t work because the trainer keeps asking for targets of size [*, 2] without expanding the labels into one-hot.

Now, this could be a bug or there might be a specific way to make it work.BCEwithLogitsLoss is a much better option than MSELoss and I can’t think of the reason why someone would use MSELoss.

forwins · May 14, 2024, 6:30am

Hi there. MSELoss is actually not bad for binary classification, because it used sigmoid function and sum up to 1, but if u gonna use model with multi-label-problem u have to choose BCEwithLogitsLoss because it uses the one hot encoding with softmax and logits therefore whatever how many categories u want to predict it will sum up to 1 and works well

Topic		Replies	Views
Mullti Label Text Classification 🤗Transformers	2	1569	June 26, 2023
Custom BCEWithLogitsLoss for Sequence Classification using Auto Model Beginners	1	19	May 26, 2025
XLNetForSequenceClassification 🤗Transformers	27	1214	January 16, 2021
Multi label classification with large number of labels and sparse data 🤗Transformers	1	1523	July 15, 2023
Multi-label token classification 🤗Transformers	34	7679	September 6, 2023

SequenceClassification num_labels<2 doesn't work on trainer

Related topics