Scores in generate()

patrickvonplaten · February 4, 2021, 10:22am

good observation! So output_scores should max_length - 1. The reason is that the first token, the decoder_start_token_id is not generated, meaning that no scores can be calculated.

Here an example:

#!/usr/bin/env python3
from transformers import AutoModelForSeq2SeqLM
import torch

model = AutoModelForSeq2SeqLM.from_pretrained('facebook/bart-large')

out = model.generate(torch.tensor([10 * [1]]), return_dict_in_generate=True, output_scores=True, max_length = 10)

print("len scores:", len(out.scores))  # should give 9

Would you be interested in correcting the documentation in a PR for Transformers?

Topic		Replies	Views
Generate: How to output scores? Beginners	2	2495	April 7, 2021
Shape mismatching between `sequences` and `scores` in beam search generation 🤗Transformers	1	572	September 14, 2022
How to get the scores of a certain beam 🤗Transformers	0	389	June 13, 2022
[Announcement] GenerationOutputs: Scores, Attentions and Hidden States now available as outputs to generate 🤗Transformers	1	4648	January 13, 2021
Beam Search: Why do some beams begin with the same token? Beginners	0	682	March 31, 2021

Scores in generate()

Related topics