Packing issue, SFTTrainer

tprochenka · November 10, 2023, 2:58pm

Hi,
I’m fine-tuning llama-v2 using SFTTrainer. When I set packing=False my model overall performance gets better but on inference it just cant stop, it generates words until it hits max_new_tokens. This as he could not learn eos token…
Maybe some one have any idea what could cause this problem?

Thanks
Tomek

Topic		Replies	Views
Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation Models	5	3631	October 16, 2024
Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=True' 🤗Transformers	2	483	July 24, 2024
Issue with LLaMA-3 Fine-Tuning: Model Generates Correct Answer but Then Adds Unrelated Questions 🤗AutoTrain	5	310	April 8, 2025
LLAMA 2 Tokenized Inputs Use Too Much Data Beginners	0	184	August 15, 2023
meta-llama/Llama-2-70b-hf filling up my disk 🤗Transformers	0	351	August 2, 2023

Packing issue, SFTTrainer

Related topics