LLama2-7b QA gives unwanted characters in text_output during inference

deecode · November 7, 2024, 8:24am

I finetuned llama2-ko-7b with LORA for Answering Questions based on the Context.

My training data was jsonl file with multiple texts:

Example: { “text”: “~~### Instruction:\n{question}\n\n### Input:\n{context}\n\n### Response:\n{answer}.~~” }

Model was trained for 20 epochs and I am trying to inference on triton server

I am facing output text issue!

The output always generates [\n\n\n or ### or Input] after the first sentence.

I tried:
“max_tokens”: 30,
“bad_words”: [“\n\n###”, “###”],
“stop_words”: [“\n\n###”, “.”, “!”],
“pad_id”: 2,
“end_id”: 2,
“streaming”: 1,
“early_stopping”: true,
“temperature”: 1.0,
“top_k”: 50,
“top_p”: 0.92,
“no_repeat_ngram_size”: 3,
“eos_token_id”: 2,
“num_beams”: 1,
“do_sample”: true
}’
Example: “text_output”:“경관계획은 실시설계를 완료하기 전에 수립해야 합니다. \n\n##\n\n \t\n\n \t\n\n \t”

Q: How can I prevent this issue during inference?

Topic		Replies	Views
Your LLaMA model is generating extra text before and after the expected JSON output, and it is not correctly evaluating responsesummary based on the specified factors: relevance and word count Intermediate	1	48	February 28, 2025
Llama-2-7b-chat fine-tuning Models	4	6789	April 26, 2024
Strange punctual and grammatical errors in quantized Llama-3-70b-Instruct Models	0	248	June 12, 2024
Llama-2 7B-hf repeats context of question directly from input prompt, cuts off with newlines 🤗Transformers	16	28929	January 10, 2025
LLama2 trained on completions only repeating prompt during inference Beginners	0	245	April 1, 2024