Metrics for Text Generation from T5 Model

Praneet · April 23, 2023, 6:17pm

Hey guys, I was training a T5 model and noticed that one of the metrics used for evaluation is the Exact Match metric. Is there any other metric that I could possibly use for evaluating text generation from the T5 model? If yes, could you also point me toward resources that would help me implement such metrics?

Chrode · October 13, 2023, 10:42am

hey @Praneet did you solve it? I am looking for the same approach. thanks

Praneet · October 14, 2023, 10:27pm

Sadly, I never really got around to it. I see many people just running against popular benchmarks but that won’t work for my task. So I usually create a small test set with 30 to 50 samples that I can run my LLM over and manually evaluate. I heard from a few people behind some of the popular LLMs doing something similar for smaller tasks that don’t have popular ways of evaluating them.

@Chrode

braintrustdata · November 1, 2023, 10:38pm

Hey Praneet,

Braintrust is a great tool for running those evaluations on the 30 to 50 samples. We provide a Python/Typescript library to run and log those evals and give you a web UI to visualize improvements/regressions/etc.

Use it for free @ https://braintrustdata.com/

Topic		Replies	Views
Keyword generation using T5 Models	4	1982	November 2, 2022
T5 Model Evaluation on Generation 🤗Transformers	0	421	February 8, 2024
T5 Fine Tuning - Text to Text Generation 🤗Transformers	2	1285	April 7, 2021
T5 finetuning metrics not improving 🤗Transformers	1	341	June 20, 2023
Creating t5 for language Beginners	0	238	April 9, 2022

Metrics for Text Generation from T5 Model

Related topics