Explanation about VIDEOMAE loss function for multilabel fine-tuning

omermazig · August 24, 2023, 3:22pm

In modeling_videomae.py, the loss calculation consists of this code:

loss = None
if labels is not None:
if self.config.problem_type is None:
if self.num_labels == 1:
self.config.problem_type = “regression”
elif self.num_labels > 1 and (labels.dtype == torch.long or labels.dtype == torch.int):
self.config.problem_type = “single_label_classification”
else:
self.config.problem_type = “multi_label_classification”
if self.config.problem_type == "regression":
    loss_fct = MSELoss()
    if self.num_labels == 1:
        loss = loss_fct(logits.squeeze(), labels.squeeze())
    else:
        loss = loss_fct(logits, labels)
elif self.config.problem_type == "single_label_classification":
    loss_fct = CrossEntropyLoss()
    loss = loss_fct(logits.view(-1, self.num_labels), labels.view(-1))
elif self.config.problem_type == "multi_label_classification":
    loss_fct = BCEWithLogitsLoss()
    loss = loss_fct(logits, labels)

I don’t get it. I’m trying to solve a multilabel classification problem, and my labels are:

tensor([[0, 1, 1],
[1, 0, 0]], device=‘cuda:0’)

So because my labels dtype is int, it sets my problem type to “single_label_classification” instead of “multi_label_classification” (and than CrossEntropyLoss throws a dimension mismatch exception)

Is it correct? a multilabel classification should also use int labels right? Should I change my labels dtype to float?

Thanks!

Topic		Replies	Views
Distilbert-base-multilingual-cased' Beginners	2	580	June 22, 2021
Loss function used in run_mmimdb.py Beginners	0	280	December 28, 2020
Multi-label token classification 🤗Transformers	34	7703	September 6, 2023
Mullti Label Text Classification 🤗Transformers	2	1575	June 26, 2023
Multi label classification with large number of labels and sparse data 🤗Transformers	1	1526	July 15, 2023

Explanation about VIDEOMAE loss function for multilabel fine-tuning

Related topics