Centralized Benchmarks

denyslinkov · May 23, 2022, 1:46pm

Are there any plans to centralize benchmarks for datasets or models? There are some links to papers and code for datasets and also a sentence transformers benchmark

New models are added often which is great, but it’s hard to track how they perform since each paper often cherry picks comparative examples.

lhoestq · May 24, 2022, 12:51pm

cc @lewtun this sounds related to auto-evaluation ?

lewtun · May 24, 2022, 1:03pm

Thanks for the ping @lhoestq !

Yes @denyslinkov we’re currently working on tooling that should make it much easier to run large-scale evaluations & model comparisons across the Hub. Stay tuned

Topic		Replies	Views
Discovering best models for a task Models	2	440	January 26, 2025
Where is the source to benchmark's dataset entries on the model's website Beginners	2	395	August 10, 2020
Image Dataset Benchmarking 🤗Datasets	0	30	February 5, 2025
Dataset not available in Hub Evaluator 🤗Datasets	0	227	February 21, 2023
How to Evaluate Fine-Tuned LLMs? Models	3	55	November 27, 2025

Centralized Benchmarks

Related topics