Evluation Metric for LLM output generation

316usman · November 27, 2023, 6:06pm

I have fine-tuned a model for replying in a specific tone (example pirate tone or on the data of a specfic brands marketing material). How do i check / evaluate if my model is following the intended tone, Is there any evaluation metric (like rouge, perplexity) that can be calculated to check for this type of use case Please help.

Topic		Replies	Views
Applying an evaluation metric for causal LM model Beginners	0	590	October 18, 2023
Metrics for Text Generation from T5 Model Beginners	3	872	November 1, 2023
Evaluate fine-tuned LLM for question answering Beginners	1	51	May 2, 2025
Evaluation metrics for BERT-like LMs Research	4	4615	December 6, 2024
How can I evaluate a fine tuned LLM? Intermediate	4	1026	January 7, 2025

Evluation Metric for LLM output generation

Related topics