Why is the repeating_penalty implemented using the full context rather than a generated token?

characta · June 23, 2023, 7:14am

In paper, repetition_penalty calculated only using generated tokens. But, In RepetitionPenaltyLogitProcessor implementation, input_ids(context + generated tokens) used. Why?

Topic		Replies	Views
Using penalized sampling from CTRL 🤗Transformers	1	341	February 4, 2021
Understanding repetition_penalty in LLaMA-2 Pretrained Model Models	0	5283	December 17, 2023
Repetition_penalty not working? 🤗Transformers	1	172	February 18, 2024
Prevent repeat tokens in GPT2LMHeadModel text generation with max_new_tokens=1 Beginners	0	1115	November 19, 2021
Text Generation Returns Repeat or Random Beginners	0	507	August 24, 2023

Why is the repeating_penalty implemented using the full context rather than a generated token?

Related topics