When I try to use my fine-tuned Causal LM model to inference a prompt, I get nothing but the last word repeated multiple times

Chahnwoo · April 14, 2024, 8:54am

It might be the way the model is fine-tuned (how dataset is structured, how data is processed by data collator and trainer, etc.) but repetition like the example you’ve provided also seems to be a fairly common occurrence. There exists a repetition_penalty parameter that you can try toying with: Transformers - repetition_penalty parameter

Out of curiosity, is there a particular reason you’re trying to generate English text with a model whose base language is Chinese?

Topic		Replies	Views
Generation / Inference Models	0	252	December 11, 2023
Model Tuning and Re-Tuning Problems Models	2	34	June 10, 2025
LLMs Return Prompt as Well as Generated Text Beginners	2	1465	June 20, 2024
Text Generation Returns Repeat or Random Beginners	0	509	August 24, 2023
Training causal LM from scratch - forcing prompt during training Beginners	0	286	February 11, 2022

When I try to use my fine-tuned Causal LM model to inference a prompt, I get nothing but the last word repeated multiple times

Related topics