Can trainer.predict() return multiple generations for each sample?

berkayberabi · June 1, 2021, 2:21pm

I am looking for a similar feature as in model.generate() which takes a parameter num_return_sequences. It decides how many generations should be returned for each sample. It is especially useful when using beam search and analyzing the effect of beam search on the metrics.

Trainer.predict() does not seem to support this feature. Is there a workaround? I can use model.generate() but then it was very slow last time because I have to create a for loop iterating over batches whereas trainer.predict automatically handles the data loading separating

silvia-casola · November 25, 2021, 8:16am

Hi @berkayberabi! I have a similar problem. Did you manage to solve this issue?

berkayberabi · January 18, 2022, 9:39pm

Hi Silvia,

No unfortunately, I could not solve it

Topic		Replies	Views
Generating multiple sequences with `Trainer.predict()` Beginners	0	343	February 17, 2023
Difference in trainer.predict() and model.generate() for LM 🤗Transformers	0	1788	July 5, 2023
Model.generate() is extremely slow while using beam search 🤗Transformers	2	5405	July 24, 2022
[Urgent] trainer.predict() and model.generate creates totally different predictions 🤗Transformers	4	6907	February 1, 2021
Batch_decode does not give the correct output as generate 🤗Transformers	0	300	March 17, 2022

Can trainer.predict() return multiple generations for each sample?

Related topics