How to finetune/instruction-tune a large language model on a QA corpus?

Mlemoyne · October 13, 2023, 2:55pm

I would like to finetune a large language model which was trained with causal language modelling. The fine-tuning is planned to be done on a question-answering (or instruction tuning) corpus. Therefore, we don’t want to train the model to complete the question/instruction, but only to train it to generate the answer. How to achieve this with Huggingface Transformers package?

If I load the pretrained LLM with something like GPT2ForQuestionAnswering, it will say “Some weights of GPT2ForQuestionAnswering were not initialized from the model checkpoint at gpt2 and are newly initialized: [‘qa_outputs.weight’, ‘qa_outputs.bias’]”. But actually, what we want, is to use the exact same parameters of the LLM for causal language modelling rather than changing the output layers.

arj7192 · January 20, 2024, 2:25pm

Topic		Replies	Views
Fine-tune model with CoT Intermediate	1	392	January 27, 2025
Finetuning a Large Language Model Intermediate	0	83	October 23, 2024
GPT-2 fine-tuning Beginners	0	1610	June 12, 2023
Help with autotrain/LLM finetuning please Beginners	3	2143	August 11, 2023
Fine-tuning conversational models with the technical documentation Beginners	2	1300	July 18, 2024

How to finetune/instruction-tune a large language model on a QA corpus?

Related topics