Can I use "AutoModel For Sequence Classification" class for generative models?

tomad01 · June 14, 2023, 2:54pm

I want to use AutoModelForSequenceClassification with a llama 7b model
How will the input flow in the model if I load the model with this class ?

WANGYIWEI · September 20, 2023, 3:06pm

Yes it works, I used a GPT2-XL to do NLU tasks.
But it also depends on the weights you use whether it has a class called transformers.XXXForSequenceClassification, where XXX is the model class/ name.
Such as here: https://github.com/huggingface/transformers/blob/v4.33.2/src/transformers/models/gpt2/modeling_gpt2.py#L1376

Utkarsh-Tiwari · April 15, 2024, 9:24am

Hi, did you try this? Does it give better results as compared to the BERT models? I am trying the same, but the model is overfitting on the training dataset.

Topic		Replies	Views
How is the "Auto Model For Sequence Classification" architecture? 🤗Transformers	2	3706	July 18, 2021
Implementation source code for AutoModelForSeq2SeqLM Beginners	0	983	January 5, 2022
What FineTuning can be done with a available models 🤗Transformers	4	448	February 23, 2021
XLNetForSqeuenceClassification warnings 🤗Transformers	16	4275	April 3, 2021
Properly loading a fine tuned model from directory Intermediate	2	2105	August 25, 2020

Can I use "AutoModel For Sequence Classification" class for generative models?

Related topics