Which token vector is used for Sentiment Analysis?

MahdiA · March 4, 2022, 7:57pm

I see in most of Sentiment Analysis tasks which are implemented based on BERT, only the embedding of [CLS] is passed to classifier, while others are useless. What is the reason behind it?

beneyal · March 4, 2022, 8:14pm

According to the paper, BERT’s [CLS] token aggregates the hidden states of the other tokens, which renders them “useless” for sequence classification tasks, as all relevant info is already pooled into [CLS].

DeepthiSankepalli · February 16, 2024, 12:02pm

Can you please elaborate more on it? I see the transformers/BERT layers with Token Embeddings, Segment Embeddings, Position Embeddings available along with CLS, SEP

Topic		Replies	Views
Significance of the [CLS] token Research	16	28585	September 5, 2024
TFBertModel for classification task with no CLS token Beginners	0	344	March 11, 2023
Common practice, using the hidden state associated with [cls] as an input feature for a classification task? Intermediate	3	5749	January 31, 2024
BERT and GPT2 embedding questions Beginners	2	1535	December 28, 2022
How to get [CLS] embeddings from BertForTokenClassification model Beginners	3	15196	November 27, 2023

Which token vector is used for Sentiment Analysis?

Related topics