T5 transformer tokens and scores

hle2000 · July 26, 2022, 12:13pm

Hello,

I am using beam search with the pretrained T5ForConditionalGeneration model. I am trying to implement some sort of uncertainty estimation at a token level. Therefore, I was looking at the ‘scores’ by setting return_dict_in_generate=True and output_scores=True.

output=self.model.module.model.generate(input_ids=input['input_ids'],
                                            attention_mask=input['attention_mask'], 
                                            num_beams=8,
                                            return_dict_in_generate=True,output_scores=True,
                                            output_hidden_states=True,output_attentions=True,
                                            early_stopping=True, max_length=200)

Now, output['scores'] returns a tuple as instructed by the docs. What does each tuple mean? Each tuple is a tensor of size tensor of shape (batch_size*num_beams*num_return_sequences, config.vocab_size). I can’t seem to visualize what each tuple represent. Any help would be greatly appreciated.

Topic		Replies	Views
Showing individual token and corresponding score during beam search Beginners	5	3647	November 28, 2023
Retrieving Probability Over Tokens During Beam Search 🤗Transformers	0	542	August 3, 2022
Generation scores Beginners	0	605	April 24, 2023
T5 for a multi-classification task with returning probabilities [0,1] 🤗Transformers	0	15	September 1, 2024
T5: why do we have more tokens expressed via cross attentions than the decoded sequence? Intermediate	1	385	February 21, 2023

T5 transformer tokens and scores

Related topics