Text generation models are producing incomplete response although max_new_tokens set to a high value

abhijitmaji · February 3, 2024, 1:55pm

I am using langchain.
model = HuggingFaceHub(
repo_id=“mistralai/Mistral-7B-Instruct-v0.2”,
task=“text-generation”,
model_kwargs={
“max_length”: 1024,
“max_new_tokens”: 512,
“top_k”: 30,
“temperature”: 0.1,
“repetition_penalty”: 1.03,
},
)

It’s generating incomplete response or generation is stopped in the middle way.

dev4sidra · August 2, 2024, 12:40pm

did you find solution?

Topic		Replies	Views
Incomplete/ partial response generation Models	3	1423	March 27, 2024
Text Generation response truncation Beginners	6	1377	August 18, 2024
Stopping generation before max_new_tokens 🤗Transformers	0	806	June 1, 2023
Change length of GPT-neo output Beginners	6	1884	June 10, 2021
How do I increase max_new_tokens Beginners	3	29539	August 19, 2023

Text generation models are producing incomplete response although max_new_tokens set to a high value

Related topics