I am creating training inputs from data like this:
[
{ "role": "user", "content": "prompt 1" },
{ "role": "assistant", "content": "response 1" },
{ "role": "user", "content": "prompt 2" },
{ "role": "assistant", "content": "response 2" }
]
which will be converted to something like this for mistral’s instruct syntax:
<s>[INST] prompt 1 [/INST] response 1</s> [INST] prompt 2 [/INST] response 2</s>
My question is, can Mistral-7B-Instruct-v0.2 still learn from data such as the below, which ends with a user message instead of an assistant?
[
{ "role": "user", "content": "prompt 1" },
{ "role": "assistant", "content": "response 1" },
{ "role": "user", "content": "prompt 2" }
]
or…
<s>[INST] prompt 1 [/INST] response 1</s> [INST] prompt 2 [/INST]
Could it still be useful to include the extra user message so the AI could learn from the token sequences in the final user prompt? Or is it a bad idea to include an instruction without a corresponding response?
Thanks a lot for any insight you have into this.