Empty BERT Model, any help?

TheTrash · January 3, 2024, 5:48pm

I am trying to train BERT For Masked LM from the BertForMaskedLM model.
My question is:
Is this model “empty”?

Let me explain further:
With BertForMaskedLM.from_pretrained("bert-base-uncased") we load the “bert-base-uncased” model and can use it.
But if I call the model like this.


# INITIALIZE THE MODEL
model_config = BertConfig(
        vocab_size = vocab_size, 
        max_position_embeddings = max_length
      )

model = BertForMaskedLM( config=model_config )

Am I actually loading BertForMaskedLM without using the pretrained model? Is this model “empty” as i can assume?

Sandy1857 · January 5, 2024, 9:21am

The way you instantiate the model does not load the pretrained weights .So, you would be correct in saying it’s “empty”.

system · January 12, 2024, 10:32am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Bert with different layer architecture (Monarch Mixer) without pretrained weights Models	2	170	March 12, 2024
BertForMaskedLM model require fine-tuning? Beginners	0	645	August 7, 2022
Why aren't all weights of BertForPreTraining initialized from the model checkpoint? Beginners	3	1591	October 5, 2021
Are the weights of the maskedLM head of the `BertForMaskedLM` model pre-trained? 🤗Transformers	0	418	October 19, 2020
Questions on the `BertModelLMHeadModel` 🤗Transformers	7	6243	October 5, 2020

Empty BERT Model, any help?

Related topics