Fine-tune Llama2 evaluation

SuperOvni · November 27, 2023, 4:45pm

Hello,
I want to fine-tune Llama2 for a specific text generation task, I would like to get the model outputs for my evaluation dataset in text form to be able to perform custom metrics. It looks like I can use the compute_metrics function or I can use generate to get the output of my evaluation. But I’ve got a problem: when I decode my outputs, they’re absolutely not the same for the two methods, why?

Topic		Replies	Views
Custom evaluation during Llama2 fine tuning Beginners	1	1056	January 17, 2024
Fine tuning a LLaMa 3 with QLora - metrics calculation Beginners	1	885	October 17, 2024
Repetitive Token Generation During Evaluation in Fine-Tuned LLaMA Model 🤗Transformers	1	29	March 6, 2025
Selection for suitable compute metrics in SFTTrainer for QA 🤗Transformers	0	673	September 18, 2023
Llama2 finetuning for summarization mlsum 🤗Transformers	0	450	August 29, 2023

Fine-tune Llama2 evaluation

Related topics