ByT5 for text classification

janyfe · July 30, 2021, 10:09am

Hi!

Facing a problem whilst trying to finetune the ByT5 model for the text classification task.

I’ve tried to use this notebook exploring-T5/t5_fine_tuning.ipynb at master · patil-suraj/exploring-T5 · GitHub by @valhalla for ByT5 finetuning on the text classification task. When I start from ‘google/byt5-small’ I get really strange results. A model always generates negative sentiment label (‘n’ in my code)

When I switch to t5-small pretrained checkpoint the results are reasonable

What am I missing here? Any piece of advice would be much appreciated!

Chandrai · June 20, 2022, 5:37am

Hello @janyfe . Could you find a solution for this? I am also facing same issue with multi-class text classification task. Among 6 classes, byt5 always predicts 1 class.

janyfe · June 20, 2022, 11:56am

Hello @Chandrai! No, unfortunately, I wasn’t able to find the solution and gave up using byt5 model.

mohanadhafez · July 21, 2024, 7:27am

Hello @Chandrai. Did you find a solution?

Topic		Replies	Views
Text Binary Classification with Byt5 Models	0	465	June 20, 2022
Finetuning T5 for multi class classification Intermediate	0	947	January 6, 2022
Fine-tuning T5 for sentiment classification Beginners	3	3626	December 22, 2023
Good models for few-shot multi-label text classification Beginners	0	1934	March 23, 2022
T5 for classification task 🤗Transformers	0	486	April 25, 2023

ByT5 for text classification

Related topics