Fine-tuning LLM for RAG

Chahnwoo · June 10, 2024, 12:33am

I am currently fine-tuning a LLM with a custom QA dataset, and was wondering whether there would be a significant difference whether I ran QLoRA fine-tuning with a model initialized with AutoModelForCausalLM and a model initialized with AutoModelForQuestionAnswering. If there does exist a significant difference, which of the two is preferrable?

For additional context, the dataset I will be fine-tuning with consists of three columns: question, context, and answer. The three columns are then formatted with a prompt along the lines of

### Instruction
Use the context below to generate an answer to the provided question. If the context does not contain sufficient information, state that an answer could not be found.

### Context
{context}

### Question
{question}

### Response
{response}

nielsr · June 10, 2024, 7:14am

Hi,

The difference is generative vs. extractive question answering.

The AutoModelForQuestionAnswering class is meant for extractive question answering, i.e. the model is a classifier and needs to determine which text token of the context is at the start of the answer and which part of the context is at the end.

The AutoModelForCausalLM classs is meant for generative question answering, i.e. the model is a generative model (like ChatGPT) and can generate any text given the context + question. This means that the model can give answers that go beyond the literate text of the context.

system · June 18, 2024, 12:08am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Model Tuning and Re-Tuning Problems Models	2	34	June 10, 2025
Fine tuning using llm Qlora Beginners	0	904	March 20, 2024
Help, please! Seems fine tuning on LLM is not working Beginners	4	1535	April 5, 2024
Fine-tuning with Different Model Heads Intermediate	4	766	April 30, 2024
What is precisely happening during LLM fine-tuning with autoTrain? 🤗AutoTrain	1	629	October 27, 2023

Fine-tuning LLM for RAG

Related topics