PEFT LORA for Text Classification?

maadnfritz · June 6, 2023, 9:45pm

I am working on a project for text classification. I started out with BERT and AutoModelForSequenceClassification and now i want to move up the food chain and try some larger models. to do that on a single A100 i was hoping to use PEFT, LORA and bitsandbytes or accelerate for example. Bit the examples i have found all use AutoModelForCausalLM.

I tried to adapt one of the tutorials to use BloomForSequenceClassification but the
PEFT tutorial suggested " Finally, we need to apply some post-processing on the 8-bit model to enable training, let’s freeze all our layers, and cast the layer-norm in float32 for stability. We also cast the output of the last layer in float32 for the same reasons." and had code
model.lm_head = CastOutputToFloat(model.lm_head)
Which is fine for AutoModelForCausalLM but does not work for BloomForSequenceClassification because it does not have an lm_head layer.

Which leads me to ask, can one use PEFT and LORA with a AutoModelForSequenceClassification?

Alternatively, can one use a AutoModelForCausalLM for text classification?

cheers

jonathanlyj · December 8, 2023, 6:12am

I have exactly the same problem. If removing CastOutputToFloat line entirely, another error is triggered. Is there a solution for this?

Topic		Replies	Views
Pipeline Error: PeftModel... is not supported for text-classification 🤗Transformers	1	480	May 29, 2024
How to fine tune BertForSequenceClassification with PEFT? Intermediate	0	943	May 10, 2023
Trainable weights in automodel and comparison with lora 🤗Transformers	0	219	April 28, 2023
PEFT for Token Classification with Large Language Models Beginners	2	560	January 31, 2025
Merged LoRA & text generation inference issues Models	5	2432	November 20, 2023

PEFT LORA for Text Classification?

Related topics