Use external embeddings

adir · July 13, 2022, 9:07am

Hi,

I’m trying to change GPT2 model in the following way:
Instead of passing sentences through the tokenizer and tokenizer output to the model, I want to use my own defined external embeddings directly as input to the model, i.e. I want to get GPT2 12 pretrained layers as a model, and pass them as input to the embeddings vector. I saw that I can extract layers by: gpt.base_model.h, so I tried the following code (just for example):

gpt = AutoModel.from_pretrained('gpt2')
model = nn.Sequential(*self.gpt.base_model.h)
model(torch.ones((1, 5, 768)))

But I get an error: TypeError: layer_norm(): argument 'input' (position 1) must be Tensor, not tuple , has anyone encountered this scenario and know how to fix it?

Thanks

Topic		Replies	Views
Recovering input IDs from input embeddings using GPT-2 Models	1	1251	March 1, 2023
Feeding embeddings to `model.generate` Models	0	656	December 1, 2022
How to input word2vec embeddings to gpt2 model? 🤗Transformers	0	635	May 17, 2022
How to calculate word and sentence embedding using GPT-2? Beginners	0	631	January 3, 2024
Need help with gpt2 model Beginners	0	585	July 9, 2023

Use external embeddings

Related topics