How do I change the classification head of a model?

pchhapolika · January 7, 2022, 11:47am

Hi @oliverguhr Which solution worked for you for binary classification?

carlosaguayo · January 11, 2022, 11:25pm

Thank you! This fixed my problem too!

It was weird that this didn’t work:
AutoModelForSequenceClassification.from_pretrained(“huggingface/CodeBERTa-language-id”, num_labels=15)

but this did:

config = AutoConfig.from_pretrained(“huggingface/CodeBERTa-language-id”)
config.num_labels = 15
model = AutoModelForSequenceClassification.from_config(config)

Answering to tolgayan, the point is that this gets trained, you’re fine tuning the model.

sorryhyun · April 13, 2022, 10:40am

I think that this is the simplest and intuitive one. Why did nobody like this?

shaked571 · May 9, 2022, 11:52am

I find the solution by @nielsr i.e adding the parameter ignore_mismatched_sizes the most elegant and simple one. It also explains what happens within the code.

nielsr · August 3, 2022, 9:35am

Hi @carlosaguayo,

Initializing a model from a config will randomly initialize all the weights of the model. To use the pre-trained weights and add a new, randomly initialized head on top, you would need to do:

from transformers import AutoModelForSequenceClassification

model = AutoModelForSequenceClassification.from_pretrained(“huggingface/CodeBERTa-language-id”, num_labels=15, ignore_mismatched_sizes=True)

narenx86 · August 23, 2022, 9:20am

Simple but Best Solution it solves all everything.

pacoelflaco · November 19, 2022, 1:34am

I used the ‘label_names’ argument on my trainer to define which labels I wanted and not the default ‘labels’ choice.

On the trainer, I set ‘num_labels=6’ and ‘ignore_mismatched_sizes=True’ appropiately, however, when doing trainer.train() I get the following error:

TypeError: forward() got an unexpected keyword argument ‘cohesion’
(My 6 labels are [‘cohesion’, ‘syntax’, ‘vocabulary’, ‘phraseology’, ‘grammar’, ‘conventions’])

How would I fix this? Thanks in advance!

EDIT: I fixed this passing down a label matrix as labels instead of using label_names on training args, but if someone knows how to properly use that trainer argument Id appreciate it

minhleduc · May 29, 2023, 4:05am

In fact, you can custom the pre-trained model by change its layer. For instance, I use BertModelForSequenceClassfication for classification task.

from transformers import BertTokenizer, BertForSequenceClassification, AutoModelForSequenceClassification

tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')

model = AutoModelForSequenceClassification.from_pretrained('bert-base-uncased')
model.to(device)

However, I want to change its classification head, I would do this

cp_model = deepcopy(model)
cp_model.classifier = nn.Sequential(
    (nn.Linear(768, 526)),
    nn.Dropout(0.1),
    nn.Dropout(0.1),
    (nn.Linear(526, 258)),
    nn.ReLU(),
    nn.Dropout(0.1),
    (nn.Linear(258, 2)),
    nn.Softmax()
)
cp_model.to(device)

In fact, you can do directly on the model, but I want to make a copy because I do not want to change any thing on the base model (it just personal).

SUNM · June 8, 2023, 12:54pm

Hello @nielsr

I hope you are well. I am fine tunning the gpt-neo and to overcome the overfitting I want to increase the drop out to 0.2. if I do this by applying this current command, can I use the model for fine tunning directly with my own dataset?

from transformers import AutoTokenizer, AutoModelForMaskedLM

tokenizer = AutoTokenizer.from_pretrained("gpt-neo")

model = AutoModelForMaskedLM.from_pretrained("gpt-neo",embed_dropou=0.2,resid_dropout=0.2,attention_dropout=0.2, )

naveenmarthala · August 13, 2024, 2:00pm

Since, using num_lables=2 replaces the existing classifier head with a newly initialized head, right! In the code you have given for building a custom classifier head, will the existing classifier be retained? If yes, how to remove that and ensure that it has no effect? Or is it fine that is retained?

nielsr · August 26, 2024, 2:09pm

Yes you can, note that the from_pretrained method puts your model in evaluation mode by default, so you would have to call model.train() before training (the Trainer automatically takes care of that).

hieuhieuhieu235 · November 14, 2024, 3:44pm

And how you can load it ? I got an error when load it, it didn’t load a edited model.

Topic		Replies	Views
Retrain/reuse fine-tuned models on different set of labels Beginners	7	4918	April 8, 2021
Including classification heads in BERT saves 🤗Transformers	1	811	April 6, 2023
Loading pytorch_pretrained_bert models with transformers Beginners	2	1898	April 29, 2021
Loading trained model with new vocab Beginners	2	1089	April 10, 2024
Save a Bert model with custom forward function and heads on Hugginface Intermediate	1	1968	June 7, 2022

How do I change the classification head of a model?

Related topics