Undestarding output_attentions= True

Mari22 · November 25, 2023, 4:23am

I have a question about output_attentions, and I need to make a heatmap about the attention of the final layer of the BERT model. But I do not know if the output_attentions[0] is the first or last layer. I tried to check the documentation, but I did not find it.

Topic		Replies	Views
Can we access attention component and feed-forward component of a Bert layer? Research	2	979	September 23, 2024
Attentions not returned from transformers ViT model when using output_attentions=True 🤗Transformers	4	877	July 10, 2024
Can I compare the attention of different encoder layers? Beginners	0	207	December 13, 2022
Is attention of different encoder layers comprabale? 🤗Transformers	0	280	December 6, 2022
Finding Serverless Inference APIs that support attention outputs (output_attentions = true) Intermediate	0	140	March 19, 2024

Undestarding output_attentions= True

Related topics