Visualizing Attention Maps in SwinV2

VedaantJain · July 17, 2023, 8:00pm

I have been trying to visualize attention maps of vision transformers, I was able to do so for ViT using the attention rollout method.

However when I tried doing so for SwinV2, I observed that the shapes of attention states tensors in Swinv2Large model were:
torch.Size([16, 144, 144])
torch.Size([4, 144, 144])
torch.Size([1, 144, 144])
torch.Size([1, 36, 36])

So, the attention matrices were of different sizes so could not be recursively multiplied to get the attention flow. I could do it for the first three attention states, but I am not sure that it is the right approach.

TLDR: I want to know how to visualize attention maps from SwinV2 transformers given that the shapes of attention are not the same. Is there a paper or a code repository I could refer to?
Thank you for your help.

Dylonsword · December 29, 2023, 8:07am

Hello, Had you deal with this problem?

Topic		Replies	Views
How to plot an attention map for Vision Transformer model Beginners	0	2224	April 12, 2024
Understanding what went wrong in attention Research	5	1674	July 31, 2020
How to visualize attention of a large encoder-decoder transformer model that isn't a model on hugging face? 🤗Transformers	0	2343	June 28, 2021
Seq2Seq Trainer plot attention maps 🤗Transformers	0	455	July 18, 2022
Error when trying to visualize attention in T5 model Beginners	9	1707	October 23, 2025

Visualizing Attention Maps in SwinV2

Related topics