Should I use BertConfig? Why these output are different?

beneyal · February 11, 2022, 8:57am

Hello

The 1st and 2nd snippets load the weights of the prajjwal1/bert-tiny model (either with or without the LM head), so their outputs are the same.

The 3rd snippet only loads the config, meaning no weights are loaded, the model variable contains an untrained model, so the outputs will differ.

Topic		Replies	Views
Difference BertModel, AutoModel and AutoModelForMaskedLM 🤗Transformers	8	5090	March 9, 2025
Should I use BertModel or BertModelForLM? Beginners	2	461	February 10, 2022
Differences between Config.from_pretrained and Model.from_pretrained 🤗Transformers	1	1132	July 20, 2021
How to use AutoModel Beginners	0	2001	May 4, 2021
Comparing output of BERT model - why do two runs differ even with fixed seed? Beginners	2	654	January 18, 2022