Instruction tuning a pre-trained base model

how to pass instruction, context and answer in the training dataset?
I am trying to instruction tune phi3.5 but unsure of how to pass above in training dataset.

My use-case is to create a planner agent for a chatbot which does the task decomposition.
I generated the training dataset using gpt-4o and with a refined prompt and manually correction in the output wherever required.
I maintain the entity context, current user input and last bot reply which was passed to the prompt along with instructions to generate the desired output.

1 Like