Help with preparing train data for fine-tuning llama 3.1 instruct model?

pineapple96 · October 27, 2024, 8:56am

Hi everyone,

I just started learning and want to fine-tune the Llama 3.1 70b-instruct model.
I have the retrieved context and user query, along with a prompt telling the LLM to answer the query using the context.

Is this the right one to prepare query-response pair?
‘’‘<|begin_of_text|><|start_header_id|>user<|end_header_id|>
Answer the user query using the provided context.
Query:
{query}
Context:
{context}
Context ends here.
<|eot_id|><|start_header_id|>assistant<|end_header_id|>
{response}<|eot_id|>
"’’
In which format should I combine all query-response pairs together?

Thanks a lot for the help!

Topic		Replies	Views
Llama-2-7b-chat fine-tuning Models	4	6781	April 26, 2024
Model Fine Tuning using Llama-2-7b-chat-hf not working for text-to-SQL task Beginners	0	303	June 14, 2024
My adapter model dominating the entire base model Models	1	116	August 6, 2024
Data Format while finetuning Llama2 for json extraction? Beginners	2	1541	November 10, 2023
LLaMa3.1 8B Instruct Prompt Tuning for Text Classification doesn't improve test accuracy Models	3	774	October 1, 2024

Help with preparing train data for fine-tuning llama 3.1 instruct model?

Related topics