Export the embeddings layer of a pre-trained model as a standalone model

Mihail · January 31, 2024, 1:48pm

I would like to import a pre-trained model, extract the embeddings layer and export that as a standalone model. I did something like this:

from transformers import AutoTokenizer, AutoModelForPreTraining

tokenizer = AutoTokenizer.from_pretrained("deepset/gelectra-large")
model = AutoModelForPreTraining.from_pretrained("deepset/gelectra-large")

embedding_model = model.electra.embeddings

embedding_model.save_pretrained("...")

This returns the error that the object has no save_pretrained method. Any ideas how this is done?

One idea that came to mind is to include the extracted embeddings layer in a new model object, but not sure what config to use for its initialisation.

Mihail · January 31, 2024, 2:47pm

Solved. I just used the save methods from PyTorch on the extracted embedding objects.

Topic		Replies	Views
Where to pick-up embedding data from BERT model? Models	2	880	February 8, 2022
How to fine-tune a pre-trained model and then get the embeddings? Beginners	2	3743	December 20, 2022
Exporting models Models	6	2898	March 15, 2021
Set_input_embeddings() values not being saved with save_pretrained() 🤗Transformers	3	432	December 26, 2023
How to obtain [CLS] embeddings from fine-tuned BERT model (using Transformers Trainer) Beginners	1	2659	June 27, 2022

Export the embeddings layer of a pre-trained model as a standalone model

Related topics