"Masking" the prompt / repeated portions

amrothemich · February 21, 2024, 2:32am

Hi, I’m tuning Mistral 7B (qlora) for a very specialized task which has the following structure

a paragraph-long prompt
a 2 paragraph assistant response
another 1 sentence prompt
a structured JSON assistant response. The prompt portions will always be exactly the same.

The training is converging very quickly (500-600 steps with no batching, out of a total 5000 training rows), and I’m worried it’s because it’s over fitting on the fixed portions.

My question is, should I

write a custom loss function to either ignore or downweight the prompt?
do away with prompting altogether and hope the model will learn the task organically?
something else?

Thanks!!

Topic		Replies	Views
Keep getting the same output from Mistral-7b-Instruct Beginners	4	1348	December 24, 2024
Finetuned model outputs verbosity Beginners	0	165	March 8, 2024
Training in a long prompt Beginners	3	387	January 15, 2024
Fine-tuning Mistral help Models	0	865	December 4, 2023
How is the prompt + answer handled during training Beginners	0	112	March 20, 2024

"Masking" the prompt / repeated portions

Related topics