Finetuning GPT for classification task fails

Belowzero · November 18, 2021, 4:10pm

Has anyone tried fintuning GPT for classifications tasks (don’t ask why GPT, I’m testing one hypothesis)?
I took this GPT2 Finetune Classification - George Mihaila as a base and built my binary classifier from GPT Neo 128. Unfortunately, after 5 epoch the F1 metric came from 30% to zero. The logits on the classification head are those that always predict zero as an output. What might go wrong, or GPT just is not suited for classification?

Topic		Replies	Views
Using GPT-J for custom sequence classification Beginners	0	407	September 14, 2022
Fine Tuned GPT2 model performs very poorly on token classification task Models	4	1855	February 1, 2022
Which model/class should I use to fine tune GPT2 for text classification? Models	0	455	June 27, 2023
Zero-shot classification fine-tuning Beginners	2	1195	March 18, 2022
Fine-tuning flan-t5-small for classification Beginners	0	233	February 28, 2024

Finetuning GPT for classification task fails

Related topics