Fine-tuning T5 for sentiment classification

dammy · June 29, 2023, 9:23am

Hi all,

I would like to fine-tune a T5 model for sequence classification (specifically sentiment classification). However, all the tutorials are doing seq-2-seq analysis, such as text summarization as below. I understand why it uses ROUGE score for the cost calculation and it uses AutoModelForSeq2SeqLM package since it is seq-2-seq task. I think I need to change my cost function and also need to replace AutoModelForSeq2SeqLM by something? Any suggestions/ideas how to proceed?

Thank you

Best

merve · June 29, 2023, 2:29pm

Hello @dammy,
You can check out this notebook to fine tune T5 on sequence classification (not in the classification way like encoder only models do but rather with conditional generation. In case you haven’t checked, I’d suggest you to use encoder only models with classification heads and see how they perform. It’s a simpler approach that would work better for classification IMHO.

dammy · June 30, 2023, 2:25pm

Thanks @merve, I have tried BERT as an encoder only method, the accuracy is fine. However, I want to see if I can have something better with T5 fine-tuned.

CUIGuy · December 22, 2023, 11:23pm

@dammy how did your experiment with T5 fine-tuning on sentiment classification?

Topic		Replies	Views
Fine-tuning T5 Model on a Book for Unsupervised Learning Models	0	378	April 17, 2024
T5 model for summarization far from SOTA results Models	0	1343	July 2, 2021
Finetuning T5 for multi class classification Intermediate	0	946	January 6, 2022
Finetuning T5 on translation task 🤗Transformers	0	490	September 10, 2021
Fine-tune T5 model for Casual Language Modeling(CLM) Models	1	753	April 26, 2023

Fine-tuning T5 for sentiment classification

Related topics