Force decoder to avoid repetition between generated sentences

mengyahu · August 7, 2020, 3:31am

I would like to fine-tune T5 for diverse paraphrase generation. For each original sentence, I would like to have several different paraphrases generated, but current results contain sentences very similar to each other.

Example:
Original Question ::
What is the expected close date of the opportunity
Paraphrased Questions Generated by T5::
0: What will be the expected close date of the opportunity?
1: What is the expected closing date for the opportunity that you are considering?
2: What is the expected close date of the opportunity?
3: What is the expected close date on the opportunity?
4: When would be the expected close date of the opportunity?

I tried to add diversity measure in the training but was notified it wouldn’t work.

Thus, I want to directly force the decoder to avoid the repetition of ngrams between generated sentences during testing. The ‘generate’ function has two parameters: repetition_penalty, no_repeat_ngram_size. I check the paper and the source code, if I understand correctly, they just avoid repetition along the beam rather than between the sentences. No surprise: I tried different values of the two parameters and there seems no effect.

Thus, I was wondering if there is any simple way to penalize the repetition between sentences? My thought is, during beam search, to penalize the probabilities of repetitive words on different branches at the same/ nearby step. Is there open source code available for this? If not, is there anything I need to pay attention to when I modified the 'generate()’ function for this?

patrickvonplaten · November 2, 2020, 1:12pm

Hey @mengyahu - this sounds like a cool use case! You are right we the current generate() method it is not really possible to avoid repetitions between sentences. It’s quite a special case so I’d suggest that after this PR is merged: Big `generate()` refactor you to make a fork of the transformers repo and try to tweak the beam_scorer.update() function (or the BeamSearchScorer class in general to add a penalty as needed).

kmfoda · April 16, 2021, 10:59am

Hey @mengyahu. I’m facing a similar issue where I’m getting repeated sentences in summaries I’m looking to produce. Did you get a chance to add this penalty for repeated sentences? Happy to help work on it if not.

mengyahu · April 16, 2021, 5:07pm

@kmfoda I have not found a way to add that penalty yet. As I moved on to other projects soon after I posted the question, I did check if PR is merged as mentioned by @patrickvonplaten .
So it would be great if you continue working on this and post solutions if you find it.

kmfoda · April 20, 2021, 9:50am

Thanks @mengyahu. Actually for my use case I found that no_repeat_ngram_size worked great because I was looking to avoid sentence repetitions in the same single output. I’m guessing you want to avoid repetitions across the multiple outputs produced. Let me have a think about how that might be done and if I make some progress I’ll submit a PR.

Topic		Replies	Views
T5 user defined loss function Beginners	11	4793	September 23, 2020
T5 generates repetitive sentences 🤗Transformers	3	775	May 2, 2024
TrOCR repeated generation Beginners	3	1314	November 30, 2021
Prevent repeat tokens in GPT2LMHeadModel text generation with max_new_tokens=1 Beginners	0	1117	November 19, 2021
Transformers - repetition_penalty parameter Beginners	3	32931	April 4, 2025

Force decoder to avoid repetition between generated sentences

Related topics