Output Includes Input

kurbster · June 15, 2021, 3:24am

Whenever I am generating text the input is included in the output. When the input is close to the maximum length the model barely produces any useful output.

Information

When using transformers.pipeline or transformers.from_pretrianed, the model is only generating the input, when the input is long. For example,
generator = transformers.pipeline('text-generation', model='gpt2')
prompt = "really long text that is 1023 tokens ..."
output = generator(prompt, mex_length=1024, do_sample=True, temperature=0.9)
output in this case would be equal to the input prompt.

To Reproduce

Here is a Collab notebook with simple examples of the problem. I am looking to generate output from input ~1300 tokens and running into this issue consistently. Is there a way around this?

Skylixia · June 16, 2021, 11:09am

I am experiencing the same issue with another model … Help with this would be appreciated

EstebanSir · November 19, 2021, 2:38pm

I have the same problem with GPTJ, did you manage to fix this?

yanming · September 29, 2022, 2:26am

The same issue for me. Had you solved it? : )

Topic		Replies	Views
Output token lengths of smaller models 🤗Transformers	0	499	October 30, 2023
How to change max_length of a fine tuned model 🤗Transformers	4	11423	May 11, 2024
Pipeline max_length 🤗Transformers	2	3881	February 23, 2024
Model.generate generates way too long outputs 🤗Transformers	0	311	September 9, 2023
Text generation max length 🤗Hub	1	3095	October 15, 2023

Output Includes Input

Information

To Reproduce

Related topics