Autotrain ORPO Dataset format

amritap-ef · May 28, 2024, 10:37pm

Hi,

I’m wondering about what the correct format is for the “prompt” field for ORPO on autotrain? I see in this example that the dataset (distilabel-capybara-dpo-7k-binarized) used that the prompt has been formatted using chatml format already - does that mean that I should similarly format my prompts? What about if I don’t want to use the chatml format - should I format the prompt accordingly?

Also, the documentation for ORPO states that the columns needed for Reward/ORPO trainer is just the text and rejected text columns which is a bit confusing as I get an error if I try not to supply the “Prompt” field.

abhishek · May 29, 2024, 5:53am

hi. it should be same as dpo instead. ill fix the docs asap.

amritap-ef · May 29, 2024, 9:23am

Thanks, and RE the prompt format - should it be the raw text or with the chatml (etc) format?

QuestforAIEd · October 8, 2024, 8:51pm

Hi amritap,
have you found any answer to your question?

abhishek · October 8, 2024, 9:02pm

we have all the dataset formats in docs: LLM Finetuning

Topic		Replies	Views
Autotrain-advanced LLM finetuning: issues with ORPO/DPO dataset format 🤗AutoTrain	6	595	May 27, 2024
DPO training data format Intermediate	7	1326	September 23, 2024
Autotrain ORPO Error 500 Beginners	2	61	October 9, 2024
DialoGPT fine-tuning dataset format Models	3	720	April 27, 2021
What is the correct way to parse data for DPO? Do you seperate out prompt or not? Intermediate	0	201	May 19, 2024

Autotrain ORPO Dataset format

Related topics