My list of human preference datasets

gabeliu · May 10, 2023, 9:10pm

Hey all - I’ve been spending some time learning about RLHF and instruction tuning. I put together a list of human preference datasets and wanted to share it with this group. It’s at: GitHub - glgh/awesome-llm-human-preference-datasets: A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.. Give me a star if this seems useful!

I’m curious for those of you who use such datasets, how well do you find the open source options are serving your needs. Do any of you also use commercial services like Scale or MTurk? What’s one thing you would do to make such data more useful?

(P.S. if this is a topic you find interesting, feel free to grab some time with me using Calendly - G Liu)

Topic		Replies	Views
Meta Persona an abstract adaptive neural construct Research	0	729	November 25, 2020
A criticism of instruction fine-tuning datasets Research	2	2120	June 20, 2023
[Open-to-the-community] One week team-effort to reach v2.0 of HF datasets library 🤗Datasets	292	13941	October 30, 2022
Train GPT2/3 on social media posts and comments (reddit/Facebook etc) Flax/JAX Projects	4	454	June 29, 2021
Request for Further Information on Datasets Beginners	0	284	November 26, 2020

My list of human preference datasets

Related topics