Accessing uncontextualized BERT word embeddings

jdyearsley · October 29, 2020, 10:15pm

Hi there! Once I’ve imported a BERT model from HuggingFace, is there a way to convert a sequence of encoded tokens into BERT’s raw embeddings without contextualizing them using self-attention, or otherwise extract the raw embedding for a given token?

talolard · October 30, 2020, 12:03pm

Try this.
I think the apis change a bit between models so take a look before you copy paste

            model = DistilBertForTokenClassification.from_pretrained(
                "distilbert-base-cased", num_labels=self.num_labels
            )
            word_embeddings = model.distilbert.embeddings.word_embeddings(["my token ids here"])
            word_embeddings_with_positions = model.distilbert.embeddings(["my token ids here"])

jdyearsley · October 30, 2020, 7:32pm

This got me there! Thank you so much.

Topic		Replies	Views
DistilBERT and CLS token Beginners	2	2449	February 21, 2021
How to get [CLS] embeddings from BertForTokenClassification model Beginners	3	15158	November 27, 2023
Extracting embeddings with distilbert? (in tensorflow) 🤗Transformers	5	3000	August 6, 2021
Should I use BertModel or BertModelForLM? Beginners	2	455	February 10, 2022
Training BERT for word embedding Beginners	17	14469	November 12, 2022

Accessing uncontextualized BERT word embeddings

Related topics