My BERT won’t predict any special tokens

veryllm · June 20, 2024, 2:09am

I built a BERT model from scratch on domain specific sequences, for preparing the input I appended the [CLS] to the front, [SEP] to the end and [PAD]s after [SEP] when necessary. However after I trained the model using the cross entropy as the loss, and stopped the training at a 90% acc, I found that my model only predicted the non special tokens. What did I do wrong? Should I enforce some rules to the loss so that the model will have more inductive bias in producing the result?

Topic		Replies	Views
BERT for NER output of only '0' Beginners	0	671	November 14, 2021
Model gives output even for SEP token Models	0	481	February 1, 2023
Pre-training BERT Models	1	382	May 21, 2024
Self-pretrained model predicts token with -1 index gap 🤗Transformers	0	667	February 22, 2022
Special tokens and inference Intermediate	0	333	November 16, 2020

My BERT won’t predict any special tokens

Related topics