T5 sequence classification

manzar · December 7, 2020, 3:11pm

I would like to do sequence classification over the encoder of the T5 model.

Which hidden state of the last layer should I use for the classification? The hidden state of the last timestep or should I take a mean over all timesteps?

Thank you in advance.

arunwzd · May 8, 2022, 6:53am

are there any suggestions for this? I am not sure if hidden[:, 0, :] makes sense (since no [CLS] token in T5) but I found that using hidden[:,0,:] is yielding better results than torch.mean(hidden_states, dim=1). Any suggestions on whats the best way to do this in T5Encoder?

Topic		Replies	Views
T5forConditionalGeneration + classification Models	3	1274	December 13, 2020
Using T5 encoder with classification head Models	1	1872	July 17, 2022
Extracting Logits From T5 Output Beginners	5	2080	January 9, 2024
How to separately use T5 decoder Models	4	2845	July 7, 2024
How to use T5 for sentence embedding? Research	6	16015	May 27, 2023

T5 sequence classification

Related topics