Distilbert-base-multilingual-cased'

natank · June 22, 2021, 8:46am

Hello

I am running distilbert-base-multilingual-cased’ on Pytorch. My model has 4 classes in the target.

In the code in models/distilbert/modeling_distilbert.py. I am reaching this state
elif self.config.problem_type == “multi_label_classification”:
** loss_fct = BCEWithLogitsLoss()**
** loss = loss_fct(logits, labels)**

I have two questions:
1 BCEWithLogitsloss must receive the labels as one hot rather integers . must I take care of doing one_hot?
2 If I wish to add regulation functions to the loss , what is the best practice for doing so?

Thanks

sgugger · June 22, 2021, 1:21pm

Note that multi_label_classification is only for problems where you can have multiple labels for one example, so you should use the default if your samples can only have one label.
If you are in a true multiple label problem, then it’s very likely your labels are already in a one-hot format.

For your second question, you should just output the logits of your model and then compute the loss manually with your penalty. If you’re using the Trainer API, you can subclass and write a compute_loss function with that, see here for an example.

natank · June 22, 2021, 1:57pm

Thanks for the quick answer

Q2 is pretty clear
Q1. - I realized that I gave labels of int64. As we begin to train , in data_loader.py we have the block
if “label” in first and first[“label”] is not None:
** label = first[“label”].item() if isinstance(first[“label”], torch.Tensor) else first[“label”]**
** dtype = torch.long if isinstance(label, int) else torch.float**
Since is instance(label, int) for int64 is False , it converted the labels to Float. Which causes to multi_label_classifcation due to the following block

elif self.num_labels > 1 and (labels.dtype == torch.long or labels.dtype == torch.int):
** self.config.problem_type = “single_label_classification”**
** else:**
** self.config.problem_type = “multi_label_classification”**

The corollary is that one has to give labels as int and not int64

Thanks

Thanks
Natan

Topic		Replies	Views
Mullti Label Text Classification 🤗Transformers	2	1577	June 26, 2023
Multi-label token classification 🤗Transformers	34	7705	September 6, 2023
Custom BCEWithLogitsLoss for Sequence Classification using Auto Model Beginners	1	21	May 26, 2025
Fine tune for multilabel classification, shapes mismatch Beginners	0	396	December 11, 2021
Multi label classification with large number of labels and sparse data 🤗Transformers	1	1527	July 15, 2023

Distilbert-base-multilingual-cased'

Related topics