LLMs Return Prompt as Well as Generated Text

anniedoris · June 13, 2024, 4:36pm

Hi,
I am running inference on the following models: “NousResearch/Llama-2-7b-chat-hf” and “NousResearch/Llama-2-7b-hf” and “lmsys/vicuna-7b-v1.5”. To extract the text generated by each LLM, I use: model_response = sequences[0]['generated_text'].

The response that I get for all three models always contains the input prompt AND the model’s generated text. Is there a way (through inference settings) that I can make it so that the text I get back is just the model’s generated text (not including the prompt)?

swtb · June 19, 2024, 11:12pm

No, atleast not that I have seen. But you can just cut the prompt out of the generated text.

RaushanTurganbay · June 20, 2024, 6:46am

Yes, currently generate() doesn’t allow returning only the generated text. That feature might be added during the generate refactor

Topic		Replies	Views
Generation / Inference Models	0	252	December 11, 2023
Generate() returns full prompt plus answer 🤗Transformers	1	6040	February 19, 2024
Text generation using LLAMA3 Beginners	0	829	July 24, 2024
RAG LLM Generating the Prompt also at the response Beginners	8	4208	September 25, 2024
Llama2 prompt template for finetuning on text summaraization/generation Models	0	318	March 20, 2024

LLMs Return Prompt as Well as Generated Text

Related topics