Instruction fine tuning in plain pytorch

CKeibel · February 2, 2024, 1:45pm

Hi everyone!

I would like to instruction fine tune a “Zephyr-7B” and try to make it hallucinate a little less (as a generative part of a RAG pipline), respectively that it rather answers with “I don’t know” if a context is not given instead of making things up.

However, I would like to do the fine tuning in a plain pytorch loop. Would I then have to prepare my data as normal as for causal language modeling?
So that my input is then my prompt (context, question and answer) and my labels are the same as my input. (transfomers then automatically shifts the label by one token)

In short, is instruction fine tuning the same as causal language modeling except that I train my model to generate text that follows my instruction template?

Thanks in advance for any help!

Kind regards
Christopher

EDIT: I think I probably found the solution here.

Topic		Replies	Views
How to finetune/instruction-tune a large language model on a QA corpus? Intermediate	1	1954	January 20, 2024
Instruction tuning llm Beginners	8	12543	May 8, 2024
Instruction tuning a pre-trained base model 🤗AutoTrain	0	51	December 18, 2024
Fine tune text generation model Beginners	0	263	January 16, 2024
Token-by-Token Fine-Tuning of the phi-2 Model for code generation Models	0	25	September 13, 2024

Instruction fine tuning in plain pytorch

Related topics