Correct numeric labels for classification?

olaffson · September 15, 2021, 5:50pm

Hello,

This is a simple question but better safe than sorry! My understanding is that the transformers class of models (for text classification) can only deal with integer labels as classes.

So it’s up to the user to provide a mapping between labels and scores. In the usual example one could have 0 = negative, 1 = neutral, 2 = positive.

Here is the basic question: do the numeric scores necessarily need to be integers from 0 to N (the number of classes) or I can use any other numbers of my liking?

Thanks!

olaffson · September 29, 2021, 6:16pm

yes, I can confirm the labels have to be integers starting at zero. I still wonder what is the mathematical reason for that? Any ideas @nielsr by any chance?

Thanks!

Topic		Replies	Views
Huggingface transformers classification using num_labels 1 vs 2 🤗Transformers	1	1148	August 19, 2022
Multiclass Classification: "labels" format Beginners	0	670	October 26, 2022
Transformer for numeric dataset 🤗Transformers	0	640	May 20, 2023
The Best Approach for Weighted Multilabel Classification 🤗Transformers	1	69	January 24, 2025
Multilabel text classification Trainer API Beginners	8	22394	August 2, 2023

Correct numeric labels for classification?

Related topics