How to run Llama 3.1 benchmark

suu1 · September 2, 2024, 10:44pm

I want to be able to edit Llama 3.1 code locally and run benchmark.

I have seen the lm-evaluation-harness library. Looks like its easy to run benchmark on a model and dataset on hf but I am not sure it is flexible to editing model code locally.

Could someone show me?

Topic		Replies	Views
Causal LLM benchmarks Beginners	0	458	June 13, 2023
EleutherAI / lm-evaluation-harness on a custom model Models	0	1984	April 10, 2024
Benchmarking LLMs 🤗Transformers	1	1403	August 20, 2024
Evaluating my own model Intermediate	6	119	February 21, 2025
Why can't I reproduce benchmark scores from papers like Phi, Llama, or Qwen? Am I doing something wrong or is this normal? Models	2	64	June 10, 2025

How to run Llama 3.1 benchmark

Related topics