Are the weights of the maskedLM head of the `BertForMaskedLM` model pre-trained?

h56cho · October 19, 2020, 4:47pm

Hello,
Are the weights of the maskedLM output head of the BertForMaskedLM model pre-trained?
Or are the weights of the maskedLM output head randomly initialized each time the model is called?

Thank you,

Topic		Replies	Views
BertForMaskedLM model require fine-tuning? Beginners	0	645	August 7, 2022
Why aren't all weights of BertForPreTraining initialized from the model checkpoint? Beginners	3	1592	October 5, 2021
DebertaForMaskedLM cannot load the parameters in the MLM head from microsoft/deberta-base Models	3	1324	April 29, 2022
Continual pre-training vs. Fine-tuning a language model with MLM 🤗Transformers	5	8700	November 30, 2021
Questions on the `BertModelLMHeadModel` 🤗Transformers	7	6246	October 5, 2020

Are the weights of the maskedLM head of the `BertForMaskedLM` model pre-trained?

Related topics