Fine tuning a LLaMa 3 with QLora - metrics calculation

fCola · May 31, 2024, 2:55pm

Hi everybody!

I am trying to finetune a llama3-8B with peft and TRL. It seems I got everything to run correctly (or, at least the loss is decreasing). I am now trying to calculate metrics for my training. To do this I need to decode the output of the LLM in a compute_metrics custom function. However, I am not sure on how to do this. I have verified that I obtain a numpy array with a prediction for each validation example (seems to be 49xvocab_size). If I try to argmax->decode however, the output is gibberish and I think I am missing something fundamental here.
I have seen that in some libraries (LLamaFactory) SFT with Lora is done with a Seq2Seq trainer: would this be the correct way to go?
Thanks!

ngreenberg · October 17, 2024, 9:40pm

Hi, I’m having the same problem. Any luck with solving this?

Topic		Replies	Views
Trainer in PEFT doesn't report evaluation metrics 🤗Transformers	4	492	June 17, 2025
Fine-tune Llama2 evaluation Beginners	0	563	November 27, 2023
Llama2 fine-tunning with PEFT QLora and testing the model 🤗Transformers	13	15263	December 21, 2023
Is there any actual performance improvement when using LoRA alone for SFT on the LLaMA 3.2 3B base model? Beginners	2	54	June 20, 2025
Peformance metrics won't be calculated Intermediate	2	51	November 8, 2024

Fine tuning a LLaMa 3 with QLora - metrics calculation

Related topics