BERT objective - does it include corrupted tokens? (not masked)

dreidizzle · January 5, 2023, 6:38pm

Hi,

I have a question about BERT’s objective. I understand that the tokens that have been replaced with MASK should be in the objective. However, there are also tokens that are corrupted or unchanged. Should these not also be in the objective? If you have a data and you replace the token id with the token id for MASK in the input, effectively this word will have a cross entropy loss term for the model. But, how can you specify corrupted words or words that are the same, do you need to add them to the objective manually somehow?

Topic		Replies	Views
How does BERT know if a token is a mask for prediction and loss Beginners	0	419	June 30, 2022
What's the inner mechanism of Masked Language Model in BERT Beginners	0	237	March 31, 2021
Cross-validation for BERT models 🤗Transformers	0	979	December 8, 2020
My BERT won’t predict any special tokens Beginners	0	59	June 20, 2024
Where in the code does masking of tokens happen when pretraining BERT Beginners	5	7262	August 17, 2020

BERT objective - does it include corrupted tokens? (not masked)

Related topics