What the tokens are cross attentions output for?

superlyc · February 4, 2022, 8:08pm

I am trying to output the cross attentions from a T5 model generate() function. The output type is BeamSearchEncoderDecoderOutput. According to the document, the cross attention has Tuple (one element for each generated token). I check the output ids which have 35 tokens, including and . But there are only 34 cross attentions output. Which one was left out?

Thanks

mrinalr · October 25, 2024, 3:39am

Generation uses start_token_id as the prompt for the decoder to start generation. I believe that initial token is not a part of cross_attentions output.

Topic		Replies	Views
T5: why do we have more tokens expressed via cross attentions than the decoded sequence? Intermediate	1	386	February 21, 2023
Problem with returning decoder cross attentions through generate function 🤗Transformers	0	25	October 25, 2024
Google T5 cross_attentions output Models	0	40	August 29, 2024
T5 cross-attention - inconsistent results Intermediate	1	1382	May 10, 2021
Code example of getting cross attention from T5? Intermediate	0	366	February 15, 2023

What the tokens are cross attentions output for?

Related topics