How to get a model's initial input representation?

yabramuvdi · June 16, 2022, 10:06am

Hi! I am trying to recover the initial input representation that BERT generates for each token in a given sequence. I know this initial representation is made by adding the token embedding with the positional and the segment embedding (see picture below from the paper). I also know these individual embeddings (token, positional and segment) can be accessed through the model. However, is there anyway I can extract the complete initial input representation from the output of the model?

I am currently just doing this and have not been able to find this representation anywhere in the output.

# load the model
model = BertModel.from_pretrained('bert-base-uncased',
                                   output_hidden_states = True,
                                   output_attentions = True 
                                  )

# define the sentence and tokenize it
sent = "my dog is cute"
encoded_sent = tokenizer(sent, return_tensors="pt")

# pass the sentence through the model
output = model(**encoded_sent)

Thanks!

dblakely · June 17, 2022, 8:46pm

It doesn’t output the embeddings, but you could make a custom subclass of BertModel that does. For example, you could override this method and subclass the BaseModelOutputWithPoolingAndCrossAttentions class to extract the embeddings.

yabramuvdi · June 21, 2022, 8:24am

Hi Derrick! Great idea. Thanks a lot for the recommendation. I will implement this.

Topic		Replies	Views
What's the input of BERT? Beginners	4	9761	December 23, 2022
What should be used as sentence embedding for BertModel? Beginners	0	1911	May 24, 2021
How to get [CLS] embeddings from BertForTokenClassification model Beginners	3	15236	November 27, 2023
Bert embedding layer Beginners	1	2750	January 4, 2022
Accessing uncontextualized BERT word embeddings Beginners	2	1504	October 30, 2020

How to get a model's initial input representation?

Related topics