Huggingface transformers classification using num_labels 1 vs 2

nitempe · April 6, 2022, 2:01pm

question 1)

For a binary classification problem I could use num_labels as 1 (positive or not) or 2 (positive and negative). Is there any guideline regarding which setting is better? It seems that if we use 1 then probability would be calculated using sigmoid function and if we use 2 then probabilities would be calculated using softmax function.

question 2)
In both cases are my y labels going to be same? each data point will have 0 or 1 and not one hot encoding? For example, if I have 2 data points then y would be 0,1 and not [0,0],[0,1]

I have very unbalanced classification problem where class 1 is present only 2% of times. In my training data I am oversampling

bryanzhou008 · August 19, 2022, 5:02pm

For Question 1 I am wondering the same thing. Although the docs all use num_labels=2 for binary classification, I am currently using num_labels=1. Hope to know if there are any differences.

Topic		Replies	Views
Correct numeric labels for classification? 🤗Transformers	1	349	September 29, 2021
Dino2 for classification has wrong number of labels 🤗Transformers	2	466	October 2, 2023
Multiclass Classification: "labels" format Beginners	0	670	October 26, 2022
The Best Approach for Weighted Multilabel Classification 🤗Transformers	1	70	January 24, 2025
Multi-class classification with Multi-hot encoded vector Beginners	2	1383	February 26, 2023

Huggingface transformers classification using num_labels 1 vs 2

Related topics