Is it possible to finetune *ForQA models with SFT (PEFT/QLoRA)?

allantourin · January 7, 2024, 1:19pm

Hi, I am trying to follow the finetuning tutorial for question answering here. All I can do as of now is load the model in 4-bit using the bnb_config. I’m trying out Flan T5:

from transformers import AutoModelForQuestionAnswering, AutoTokenizer, TrainingArguments, BitsAndBytesConfig
from datasets import load_dataset
from peft import LoraConfig
from trl import SFTTrainer
import torch

bnb_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_compute_dtype='bfloat16',
    bnb_4bit_quant_type='nf4',
    bnb_4bit_use_double_quant=True
)
training_args = TrainingArguments(
    output_dir="my_awesome_qa_model",
    evaluation_strategy="epoch",
    learning_rate=2e-5,
    logging_steps=50,
    per_device_train_batch_size=16,
    per_device_eval_batch_size=16,
    num_train_epochs=3,
    max_steps=1,
    weight_decay=0.01,
)
model = AutoModelForQuestionAnswering.from_pretrained(
    "google/flan-t5-base",
    quantization_config=bnb_config,
    device_map={"": 0},
    low_cpu_mem_usage=True,
    torch_dtype=torch.bfloat16,
    return_dict=True,
    use_cache=True
)
tokenizer = AutoTokenizer.from_pretrained("google/flan-t5-base")

I tried to run SFTTrainer:

dataset = load_dataset('json', data_files=DATA_PATH, split='train')
peft_config = LoraConfig(
    r=64,
    lora_alpha=16,
    lora_dropout=0.05,
    target_modules=['q', 'k', 'v', 'o', 'wi_0', 'wi_1'],
    bias="none",
    task_type="QUESTION_ANS",
)
trainer = SFTTrainer(
    model=model,
    train_dataset=dataset,
    eval_dataset=dataset,
    peft_config=peft_config,
    dataset_text_field='prompt',
    max_seq_length=128,
    tokenizer=tokenizer,
    args=training_args,
    packing=False,
)
trainer.train()

and it led to this error:

TypeError: T5ForQuestionAnswering.forward() got an unexpected keyword argument 'labels'

My question is: can *ForQuestionAnswering models be finetuned using PEFT/QLoRA with SFTTrainer? I have read in a github issue that SFTTrainer is only for language modeling, and my thinking is that question answering is a language modeling task, so it must be possible. Is my thinking correct? If so, how do I finetune *ForQA models with SFTTrainer?

nielsr · January 7, 2024, 9:17pm

Hi,

It’s possible to fine-tune xxxForQuestionAnswering models using PEFT/QLoRa, but not using the SFTTrainer class, which is only meant for models with a language modeling head on top like xxxForCausalLM in the Transformers library.

In case of xxxForQuestionAnswering models, you can train the model using the Trainer class. This is because xxxForQuestionAnswering models treat the task as an extractive one, which means that they output the start and end of the answer in a given piece of context. They are not generative models. One should prepare the labels to indicate where the start and end of the answer is in each context.

system · January 10, 2024, 5:20pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Peft model from pretrained load in 8/4 bit 🤗Transformers	6	17525	October 12, 2023
FineTuning 7B model on 3080 laptop (16GO VRAM) issues Beginners	1	49	May 16, 2025
Fine tune a finetuned model Beginners	1	563	December 16, 2024
How to load a model fine-tuned with QLoRA 🤗Transformers	2	6592	July 29, 2024
Reduced inference f1 score with QLoRA finetuned model Intermediate	1	881	September 6, 2023

Is it possible to finetune *ForQA models with SFT (PEFT/QLoRA)?

Related topics