Re-Training with new number of classes

bakrianoo · January 2, 2022, 4:04pm

Hi All.

I already trained an NER (token classification) model on a custom training dataset with 19 classes. You can explore it here marefa-nlp/marefa-ner

The base model which I used to fine-tune my model was xlm-roberta-large

I have now a new dataset and need to use the last trained model marefa-nlp/marefa-ner as the base model this time.

The problem is that the last model was trained to predict the class out of 19 classes, while the new dataset is designed for just 6 classes.

I tried to load the model and reset the configurations to the xlm-roberta-large configuration like this

from transformers import AutoModelForTokenClassification, AutoConfig

base_model = "xlm-roberta-large"
ft_model = "marefa-nlp/marefa-ner"

config = AutoConfig.from_pretrained(base_model)

ner_model = AutoModelForTokenClassification.from_pretrained(ft_model, num_labels=19)
ner_model.config = config 

# THEN using the ner_model to train with the new dataset

but seems not working, as it still requires that the head size be 19

===

Does anyone know how to solve this?

Thanks

nielsr · January 3, 2022, 9:50am

Hi,

This can be done by passing the additional argument ignore_mismatched_sizes=True to the from_pretrained method.

bakrianoo · January 3, 2022, 8:07pm

Thank you @nielsr . It works now.
I think this must be highlighted more in the new documentation.

Topic		Replies	Views
Retraining pre-trained NER model with new data samples 🤗Transformers	1	397	May 3, 2024
How do I use a fine-tuned Trainer model for inference correctly? 🤗Transformers	0	981	June 9, 2023
How to train a model for ner pipeline [RoBERTa] Beginners	0	604	July 2, 2021
Train large models on large datasets by parts Beginners	0	219	April 24, 2021
How do we customize the number of entites for NER pretrained model? 🤗Tokenizers	1	352	October 6, 2022

Re-Training with new number of classes

Related topics