Inconsistent _tied_weights_keys path in BERT models

thisisiron · November 18, 2024, 5:12am

I noticed that there are inconsistent paths in _tied_weights_keys between different BERT model classes.

In BertForMaskedLM [link]:

_tied_weights_keys = ["predictions.decoder.bias", "cls.predictions.decoder.weight"]

In BertForPreTraining [link]:

_tied_weights_keys = ["cls.predictions.decoder.bias", "cls.predictions.decoder.weight"]

Questions:

Topic		Replies	Views
How does "_tied_weights_keys" work? Beginners	0	500	January 3, 2025
Weights in BERT model 🤗Transformers	1	1707	April 12, 2023
Untrained models produce inconsistent outputs 🤗Transformers	3	1161	July 30, 2020
Is "Some weights of the model were not used" warning normal when pre-trained BERT only by MLM Beginners	6	18363	March 28, 2024
Are the weights of the maskedLM head of the `BertForMaskedLM` model pre-trained? 🤗Transformers	0	418	October 19, 2020