Is Google's official BERT model and huggingface BERT model different or same?

manirai91 · March 9, 2022, 5:53am

If they are same/equivalent, how are they created?

Are the pretrained parameters just copied?
Or are they pretrained from scratch following exact same approach (objective/hyperparameters) and data mention in the paper?

I have been searching for this information for hours, but can’t find anywhere.

BramVanroy · March 9, 2022, 7:38am

They have the same parameters. As you said the parameters are copied/converted. You’ll find that the repository contains a lot of conversion scripts, to convert between PyTorch and Tensorflow. For instance this one: transformers/convert_bert_original_tf2_checkpoint_to_pytorch.py at cd56f3fe7eae4a53a9880e3f5e8f91877a78271c · huggingface/transformers · GitHub

Topic		Replies	Views
Difference of performance when finetuning bert use the huggingface or the google official code 🤗Transformers	0	446	June 20, 2022
Issue with converting my own BERT TF2 checkpoint to PyTorch and loading the PyTorch model for training 🤗Transformers	0	536	February 25, 2021
BERT performs worse than other implementations? 🤗Transformers	0	779	July 24, 2020
How to load a google's bert ckpt using tf2 🤗Transformers	3	1309	August 14, 2020
.pt PyTorch Model ->PreTrainedModel Beginners	4	783	May 1, 2024