BertForTokenClassification Classifying [PAD] tokens

gdms · August 13, 2021, 10:00am

Hello,
I implemented the W-NUT Emerging Entities Example and used the model/tokenizer for new sentences. The example has less tokens (31) than the maximum used to train (86). My input id’s was correct (pad tokens with 0) and attention mask has only 1 for the 31 first tokens. When I investigate the results I noticed that pad tokens was classified too, not onlty with O but with another type of classifications.
This behavior is correct:? We can avoid this situation? Or we have to truncate the results from the label or sentence original size?

Topic		Replies	Views
Is the attention mask and tokenization taken into account? Beginners	0	349	December 7, 2021
Apply BertForTokenClassification on partially labeled input 🤗Transformers	0	260	November 16, 2021
BERT for NER output of only '0' Beginners	0	671	November 14, 2021
Token Classification with WNUT17 Beginners	2	595	December 10, 2020
BERT Model predicting 'PAD' for NER Beginners	0	597	November 11, 2021

BertForTokenClassification Classifying [PAD] tokens

Related topics