How to fine tune BertForSequenceClassification with PEFT?

fredguth · May 10, 2023, 7:29pm

First I tried:

from transformers import AutoTokenizer, AutoConfig, AutoModelForSequenceClassification
from peft import (
    get_peft_config,
    get_peft_model,
    get_peft_model_state_dict,
    set_peft_model_state_dict,
    PeftType,
    PeftConfig,
)
checkpoint = "google/bert_uncased_L-4_H-256_A-4"

model = AutoModelForSequenceClassification.from_pretrained(
    checkpoint,
    load_in_8bit=True, 
    device_map='auto',
    num_labels=len(item), 
    )

But BertForSequenceClassification has no device_map… ok, I tried to use LORA without bitsandbytes.

But I also couldn’t do it:

peft_config = PeftConfig(PeftType.LORA, task_type="SEQ_CLS", base_model_name_or_path="BertForSequenceClassification")
model = AutoModelForSequenceClassification.from_pretrained(
    checkpoint,
    num_labels=len(item), 
    )
model = get_peft_model(model, peft_config)

Now the error is that 'PeftConfig' object has no attribute 'target_modules'

Even the collab in the PEFT blog post didn’t work for me because bitsandbytes expects a different CUDA version.

Is there a place with working examples?

Topic		Replies	Views
PEFT LORA for Text Classification? 🤗Transformers	1	1621	December 8, 2023
Prompt Tuning For Sequence Classification Models	5	2062	December 19, 2023
Finetuning llama-2 for classification 🤗Transformers	2	1928	January 29, 2024
Error while fine-tuning distilbert model Models	1	201	November 17, 2024
How to do classification fine-tuning of quantized models? 🤗Transformers	0	479	February 2, 2024

How to fine tune BertForSequenceClassification with PEFT?

Related topics