Causal LLM benchmarks

nolestock · June 13, 2023, 11:19pm

Currently I’m trying to get the LM evaluation harness running without success. I was curious if there is an easy way to Benchmark or evaluate pre-trained Generative text models inside the hugging face Library. I’m sorry if this is really obvious.

Topic		Replies	Views
Is it possible to evaluate generations/output while fine-tuning a LLM? 🤗Transformers	2	756	November 1, 2023
EleutherAI / lm-evaluation-harness on a custom model Models	0	2125	April 10, 2024
How to run Llama 3.1 benchmark Models	0	78	September 2, 2024
How does the Trainer work for Text Generation? Beginners	0	1032	August 11, 2021
New tool to improve performance of generative AI models Models	0	774	April 2, 2023

Causal LLM benchmarks

Related topics