How do I do multi Class (multi head) classification?

PhilipMay · September 30, 2020, 1:05pm

Well - this is connected to this question: BertForSequenceClassification only seems to have linear activation at the end - is this a bug?

Why is it only a thing of the loss function? IMO the different classification methods need different last layer activation functions. Binary Class needs sigmoid, one of multiple classes needs softmax and multiple of multi class needs sigmoid again. But somehow you always seem to have a linear (no) activation at the end. @sgugger

Isn’t this a bug?

Topic		Replies	Views
BertForSequenceClassification only seems to have linear activation at the end - is this a bug? 🤗Transformers	1	2892	September 30, 2020
What is the classification head doing exactly? 🤗Transformers	16	24415	November 4, 2024
Multiclass vs Multilabel Beginners	1	2614	August 11, 2020
How to use Auto Model For SequenceClassification for Multi-Class Text Classification? 🤗AutoTrain	1	3731	February 26, 2023
Which loss function in bertforsequenceclassification regression Beginners	7	15534	February 25, 2021

How do I do multi Class (multi head) classification?

Related topics