Instruction tuning a pre-trained base model

anishkhandelwal06 · December 18, 2024, 10:15am

how to pass instruction, context and answer in the training dataset?
I am trying to instruction tune phi3.5 but unsure of how to pass above in training dataset.

My use-case is to create a planner agent for a chatbot which does the task decomposition.
I generated the training dataset using gpt-4o and with a refined prompt and manually correction in the output wherever required.
I maintain the entity context, current user input and last bot reply which was passed to the prompt along with instructions to generate the desired output.

Topic		Replies	Views
A criticism of instruction fine-tuning datasets Research	2	2100	June 20, 2023
Problems with understanding instruction fine-tuning Beginners	0	453	April 2, 2024
Domain adaptation fine tune VS instruction_tuned 🤗Transformers	2	3132	January 21, 2024
Fine-Tuning + RAG based Chatbot: Dataset Structure & Instruction Adherence Issues Intermediate	7	385	March 11, 2025
Using same instructions for fine-tuning: Is this bad for the model? Intermediate	1	459	March 26, 2024

Instruction tuning a pre-trained base model

Related topics