GPT2.generate() with custom inputs_embeds argument returning tensor (1max_length) instead of (batch_sizemax_length)

JustABiologist · April 19, 2022, 9:56am

Hi everybody!

I tried to let my GPT2 model generate sequences from custom input sequences. The model does output sequences but when I use tensors of eg shape (3, 314, 1280) it only outputs a long tensor of shape(1, max_length).

CODE for reproducing:

MOD = AutoModelForCausalLM.from_pretrained(filepath_GPT2)

Out = MOD.generate(input_embeds=torch.rand(3,314,1280), max_length=40, temperature=1.0, repetition_penalty=1.2, top_k=950, num_return_sequences=1, do_sample=True, top_p=1.0)

Out.shape
torch.Size([1, 40])

The model according to model.generate documentation should output:
Return:

        :obj:`torch.LongTensor` of shape :obj:`(batch_size * num_return_sequences, sequence_length)`:
        The generated sequences. The second dimension (sequence_length) is either equal to :obj:`max_length` or
        shorter if all batches finished early due to the :obj:`eos_token_id`.

However this does not seem to work.
Already checked src code for inputs_embeds… there should be no issue there.

Thanks for the help in advance!

Best,
JustABiologist

Topic		Replies	Views
Understanding T5 with custom embedding 🤗Transformers	3	27	July 9, 2025
Feeding embeddings to `model.generate` Models	0	658	December 1, 2022
How to generate a sequence using inputs_embeds instead of input_ids? 🤗Transformers	4	8575	April 17, 2022
Model.generate generates same output for different inputs 🤗Transformers	1	623	November 13, 2023
Output Includes Input Beginners	3	1865	September 29, 2022

GPT2.generate() with custom inputs_embeds argument returning tensor (1*max_length) instead of (batch_size*max_length)

Related topics

GPT2.generate() with custom inputs_embeds argument returning tensor (1max_length) instead of (batch_sizemax_length)