Multi-class classification with Multi-hot encoded vector

Rebecka · February 17, 2023, 1:35pm

Hi, I am very new to the huggingface community and a newbie.

In my Sentiment Analysis training set I have a multi-hot encoded vector for the labels, where each 1 represents the existance of the label: [1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0].

I am trying to proceed with some tutorials based provided by Huggingface but to my knowledge there seems to be nothing regarding my problem.

# train_ds:
Dataset({
    features: ['work_id', 'labels', 'text'],
    num_rows: 307102
})

# train_ds.features: 
{'work_id': Value(dtype='string', id=None), 

'labels': Sequence(feature=ClassLabel(names=['pornographic-content', 'violence', 'death', 'sexual-assault', 'abuse', 'blood', 'suicide', 'pregnancy', 'child-abuse', 'incest', 'underage', 'homophobia', 'self-harm', 'dying', 'kidnapping', 'mental-illness', 'dissection', 'eating-disorders', 'abduction', 'body-hatred', 'childbirth', 'racism', 'sexism', 'miscarriages', 'transphobia', 'abortion', 'fat-phobia', 'animal-death', 'ableism', 'classism', 'misogyny', 'animal-cruelty'], id=None), length=-1, id=None), 

'text': Value(dtype='string', id=None)}

Could someone help me, could I am not really sure how to proceed

Best thanks,
Rebecka

ChanceChallacombe · February 26, 2023, 6:50am

Have a similar issue, did you figure it out?

ChanceChallacombe · February 26, 2023, 6:51am

There are options for binary multi label problems but not for multi class tasks with multiple hot encoded labels

Topic		Replies	Views
Dataset for multilabel classification 🤗Transformers	1	166	January 20, 2025
Multiclass Classification: "labels" format Beginners	0	670	October 26, 2022
ValueError: Classification metrics can't handle a mix of multilabel-indicator and multiclass targets Beginners	0	1035	December 8, 2022
“Value error: Classification metrics can’t handle a mix of multilabel-indicator and multiclass targets” Models	0	1050	December 8, 2022
Logits and labels must have the same shape ((512, 6) vs (6, 1)) - MultiClass Classification with BERT Beginners	0	1444	September 3, 2021

Multi-class classification with Multi-hot encoded vector

Related topics