In paper, repetition_penalty calculated only using generated tokens. But, In RepetitionPenaltyLogitProcessor implementation, input_ids(context + generated tokens) used. Why?
In paper, repetition_penalty calculated only using generated tokens. But, In RepetitionPenaltyLogitProcessor implementation, input_ids(context + generated tokens) used. Why?