Finetuning llama for classification

given131 · June 4, 2024, 2:11pm

I have some domain specific datasets, so I want to fine-tune llama or other LLMs.
Other than putting classification head on the top, I want to train the model in the generative way, letting models output text that can be thought of labels.

But the issues is that, how can I restrict model generated text to the labels that I want? I’m worried if the model would generate texts that are invalid (text that are not label)

It would be a great help!

nmcahill · June 6, 2024, 10:12pm

You could look at the probabilities on the lm_head logit associated with the first token in each of your answers to turn your lm head into a classifier. It works well if you can formulate your problem such that the answers have unique token ids try yes no answers or multiple choice a), b), c)

ayushp12 · January 21, 2025, 6:11pm

Hi, could you please elaborate more. I am fine-tuning LLAMA on a multiple-choice question-answering (MCQA) dataset. During the training phase, would it be a good approach to trim the model’s output head to just four tokens corresponding to the answer options, so that during the generation phase, the model is constrained to generate only the labels? Are there any alternative strategies I could consider for achieving this?

Topic		Replies	Views
LLAMA for MCQA dataset Beginners	0	54	January 21, 2025
Text classification and generation from the same model Beginners	1	840	July 27, 2023
Fine tune Transformers for text generation 🤗Transformers	11	12157	July 27, 2023
Multilabel classification using LLMs Beginners	12	15371	June 7, 2024
Finetuned model generating test label exactly Beginners	0	465	October 15, 2020

Finetuning llama for classification

Related topics