Accessibility of Huggingface's OpenLLMLeaderboard Benchmark Test Sets

bahgat · July 17, 2024, 9:45am

Is the new Huggingface OpenLLMLeaderboard’s benchmark test sets accessible to the public? If so, what measures are in place to prevent dataset misuse for fine-tuning and leaderboard manipulation?

LLUMOAI · July 17, 2024, 1:03pm

Hi @bahgat

Yes, the Huggingface OpenLLMLeaderboard’s benchmark test sets are public. To prevent misuse, they have data usage agreements, activity monitoring, and regular audits.

Have you seen similar measures on other benchmark platforms?

bahgat · July 17, 2024, 1:45pm

Thanks @LLUMOAI for your reply. I’m not familiar with measures on other benchmark platforms, so I can’t make comparisons. However, I’m interested in learning more about the specific measures you mentioned for the Huggingface OpenLLMLeaderboard.

Could you please provide a source where Huggingface has officially stated these measures? I’d like to see more details about:

The data usage agreements - What exactly do they entail?
Activity monitoring - How is this implemented?
Regular audits - What do these audits involve and how often are they conducted?

LLUMOAI · July 23, 2024, 2:04pm

Hi @bahgat

For details on Hugging Face OpenLLMLeaderboard measures:

Data Usage Agreements: See Terms of Service and Data Policies.
Activity Monitoring: Check the OpenLLMLeaderboard docs.
Regular Audits: Refer to their Transparency Report.

Hope this helps!

Topic		Replies	Views
Leaderboard Details Datasets Beginners	1	74	December 20, 2024
New to Huggingface Beginners	0	517	June 10, 2023
Causal LLM benchmarks Beginners	0	457	June 13, 2023
MTEB leaderboard page is unusable Spaces	5	247	February 5, 2025
Inference speed Spaces	0	371	September 17, 2023

Accessibility of Huggingface's OpenLLMLeaderboard Benchmark Test Sets

Related topics