How can I use diverse beam-search? (it isn't working in my code)

minji · October 26, 2022, 11:54am

Hello, I want to get several promising generative candidates using the diverse beam-search decoding.
But when I execute the code as follows, they all produce the same sentence.

sample_outputs = self.model.generate(
      input_ids = input_ids,
      max_length = args.max_input_length,
      num_beams = 6,
      num_beam_groups = 3,
      num_return_sequences = 3,
      pad_token_id = self.tokenizer.pad_token_id,
      eos_token_id = self.tokenizer.eos_token_id,
)

If num_return_sequences is set to be the same as num_beam_groups, won’t each group’s sentence be printed one by one?

However, all num_return_sequences sentences returned are the same.

I checked that deleting num_beam_groups returns different sentence candidates well.
I would appreciate it if you could tell me which variable I should add to use the diverse beam-search decoding.

Thanks.

sefinch · February 26, 2023, 10:43pm

Not sure if you’ve already figured this out, since this is somewhat of an old post, but I recently ran into this problem and fixed it by adding the diversity_penalty argument in .generate().

This is what controls the discouragement of similar outputs in each group in the logits processor that is used within the generate code.

See this documentation.

Topic		Replies	Views
Beam_search and generate are not consistent 🤗Transformers	0	497	May 10, 2022
Shape mismatching between `sequences` and `scores` in beam search generation 🤗Transformers	1	550	September 14, 2022
How to find the beam search score for any target output? (BartForConditionalGeneration) 🤗Transformers	0	1437	March 22, 2022
Error while generating more then one Beam output in T5 Intermediate	0	295	September 26, 2021
Why does num_return_sequences > num_beams mean? Beginners	0	2475	February 13, 2022

How can I use diverse beam-search? (it isn't working in my code)

Related topics