Tensorflow model.summary() doesn't show detail of TFBertModel

rgwatwormhill · August 20, 2020, 6:49pm

I loaded a transformers bert model into tensorflow, using

model_version = ‘bert-base-uncased’
do_lower_case = True
model = TFBertModel.from_pretrained(model_version, output_attentions=True)
tokenizer = BertTokenizer.from_pretrained(model_version, do_lower_case=do_lower_case)

which seemed to work, but the tensorflow model.summary() command doesn’t show as much detail as I would expect.

Model: “tf_bert_model”

Layer (type) Output Shape Param #

bert (TFBertMainLayer) multiple 109482240

Total params: 109,482,240
Trainable params: 109,482,240
Non-trainable params: 0

Has the model been loaded correctly?
Do I need to define the config, even though it is loading a predefined model?
Is there any way to get tensorflow to show what is in the TFBertMainLayer?

(When I loaded a similar bert-base model using keras-bert, tensorflow model.summary() showed a lot more detail)

this is using colab, with
tensorflow v 2.3.0
transformers v 3.0.2
torch v 1.6.0+cu101

Topic		Replies	Views
Python nlp transformers library understanding the methods/functions/properties Beginners	0	558	December 29, 2021
How can i output structure of TFGPT2LMHeadModel? 🤗Transformers	2	2922	July 22, 2022
TF transformers model inputs and outputs showing none? 🤗Transformers	1	1141	April 25, 2022
How to extract the encoded data of feed & forward layer in TFbertModel Beginners	0	450	August 18, 2021
Using `TFBertTokenizer` instead of `BertTokenizer` with `TFBertForQuestionAnswering` 🤗Tokenizers	1	1254	November 15, 2022

Tensorflow model.summary() doesn't show detail of TFBertModel

Layer (type) Output Shape Param #

bert (TFBertMainLayer) multiple 109482240

Related topics