How do I finetune Llama-3-8B to predict a float value?

eiontey · September 23, 2024, 11:21am

Hi, my task is to train a model on a statement with a masked token in which I have samples for with a “rating” value from -1 to 1 on how good the sample answer is for the statement. I want to finetune Llama-3-8B to predict the values when given a statement and a sample answer for it but since I am trying to predict a float value, will I have to change anything for the model? All the examples I’ve seen so far are completion tasks and outputting strings. Should I just instruct it to rate it from a value from -1 to 1 as part of the instructions or should I change the last layer to regression but how would I do that?

Topic		Replies	Views
Performance problems with finetuned model (Llama 2 7B based) Beginners	3	687	June 10, 2024
Finetuning 4bit model Beginners	1	2428	August 29, 2023
Bad Performance Finetuning Llama Chat and Instruct Models on GSM8K Beginners	5	1114	December 5, 2024
How to Load Llama-3.3-70B-Instruct Model in Float8 Precision? 🤗Transformers	1	291	December 11, 2024
Reduced inference f1 score with QLoRA finetuned model Intermediate	1	881	September 6, 2023

How do I finetune Llama-3-8B to predict a float value?

Related topics