Custom BCEWithLogitsLoss for Sequence Classification using Auto Model

sapphicart · May 26, 2025, 9:31am

I am training a BERT based model with AutoModelForSequenceClassification. I want to use class weights for the loss function so I followed this discussion and implemented a BCEWithLogitsLoss.

Problem is, HuggingFace advices using num_labels=2 for Binary Classification with the Auto Class. When I try to implement BCEWithLogitsLoss I run into the error that

Value Error: Target size (torch.Size([32])) must be the same as input size (torch.Size([32, 2]))

I understand why this is happening, the size of the b_labels is the ground truth while the logits have the shape [32, 2] but I can’t figure out how to fix it. Here’s the code:

criterion = nn.BCEWithLogitsLoss(weight=class_weights, reduction='mean')
logits = outputs.logits
loss = criterion(logits, b_labels)

Here, class_weights is tensor([1.0712, 0.9377]) Should I just use CrossEntropyLoss instead? I have dataset imbalance that’s why I want to switch to custom loss and see what happens. Please guide me towards a solution.

John6666 · May 26, 2025, 11:10am

Hmm…

BCEWithLogitsLoss is for Binary 0 or 1 use torch.nn.CrossEntropyLoss

Topic		Replies	Views
Mullti Label Text Classification 🤗Transformers	2	1575	June 26, 2023
Multi label classification with large number of labels and sparse data 🤗Transformers	1	1526	July 15, 2023
Mismatched target and input size for BCE using "multi_label_classification" Intermediate	2	7012	September 1, 2022
Multi-label token classification 🤗Transformers	34	7703	September 6, 2023
Why is BCELoss used for multi-label classification? 🤗Transformers	4	376	October 12, 2024

Custom BCEWithLogitsLoss for Sequence Classification using Auto Model

Related topics