Feeding embeddings to `model.generate`

shantanuacharya · December 1, 2022, 9:51pm

I am using GPT2 as the text generator for a video captioning model so instead of feeding GPT2 with token ids, I’m directly giving the video embeddings via input_embeds parameters.

Now during inference, to get the sentence predictions as output, I’m trying to use the .generate() function of GPT2 but I see that it only takes the token ids as inputs. Is there a way to give it the embeddings directly?

Topic		Replies	Views
Using inputs_embeds as input for GPT2 generation_utils 🤗Transformers	1	440	March 16, 2023
How to generate a sequence using inputs_embeds instead of input_ids? 🤗Transformers	4	8427	April 17, 2022
GPT2.generate() with custom inputs_embeds argument returning tensor (1max_length) instead of (batch_sizemax_length) Intermediate	0	557	April 19, 2022
Converting logits to string without .generate() Beginners	0	630	February 13, 2023
BERT and GPT2 embedding questions Beginners	2	1533	December 28, 2022

Feeding embeddings to `model.generate`

Related topics