Feedback on LLM-as-a-Judge design for open-source library

stefanwebb · August 9, 2025, 7:32pm

Hey Folks, I wanted to share a recently release feature of Oumi for LLM-as-a-Judge. Would love to get your feedback on how we can improve the API, documentation, additional features you’d like to see, and so on - we’re a community driven project after all!

Here’s the docs: LLM Judge — Oumi as well as a blog post: “ OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?* ” showing how I used it to evaluate gpt-oss-120b and gpt-oss-20b

Topic	Replies	Views
Open API standard for open-source LLMs Research	910	July 1, 2023
CFP of NLP-OSS @ EMNLP 2023 ((apologies for cross-posting) ---------------------------------------------------------------- Workshop for NLP Open Source Software (NLP-OSS) 06 Dec 2023, Co-located with EMNLP 2023 (https://nlposs.github.io/) Community Calls	774	June 9, 2023
Open Source LLMs: Your experience and recommendation Beginners	996	November 17, 2023
Safe_Mode = [True, False] Community Calls	259	December 27, 2023
[CFP] 3rd Workshop for NLP Open Source Software at EMNLP 2023 Community Calls	460	July 18, 2023

Feedback on LLM-as-a-Judge design for open-source library

Related topics