How does NER model learns from the way it is processed during training?

goku · August 24, 2020, 7:24pm

Hi guys, I am just getting started with NER using transformers. But I am getting some issues while understanding how the NER model is trained and how it is expected to perform during inference.
During training, tokenization and labeling is done in such a way that for each word the label is assigned to the first token only and -100 (ignore_index) to all other tokens of that word. But during inference, it is expected to predict the same label for all the tokens.

Training: https://github.com/huggingface/transformers/blob/6b4c617666fd26646d44d54f0c45dfe1332b12ca/examples/token-classification/utils_ner.py#L110-L117
Inference: https://huggingface.co/transformers/usage.html#named-entity-recognition

Just trying to understand how does the model learns this way. Thanks

Great ecosystem BTW

goku · August 31, 2020, 6:19pm

ping @stefan-it @vblagoje

Topic		Replies	Views
NER at the Inference Time 🤗Transformers	0	445	March 18, 2022
Ask for help with prediction results of Named Entity Recognition Task 🤗Transformers	10	3246	May 21, 2021
Tokenization in a NER context 🤗Tokenizers	5	5798	August 11, 2021
NER for chunks / sentences 🤗Transformers	4	2393	February 12, 2021
Application of a transformer model without fine tuning for NER task Beginners	2	1348	May 31, 2021

How does NER model learns from the way it is processed during training?

Related topics