How to visualize attention of a large encoder-decoder transformer model that isn't a model on hugging face?

Od-Lanir · June 28, 2021, 12:27pm

Hello, I am attempting to visualize the attention weights, of the model ‘Grover’, in its inference mode. In this mode, it produces a probability score for each input of text. I have the checkpoints and config of the model but am struggling to convert this to any form in which I can use to produce visualizations.

Any help would really be appreciated!! Also, I am very happy to answer any follow-up questions to help clarify anything.

Thanks

Topic		Replies	Views
How can one visualize the Cross-Attention of a VisionEncoderDecoderModel? 🤗Transformers	2	1951	November 7, 2023
Visualizing Attention Maps in SwinV2 Models	1	3079	December 29, 2023
How to plot an attention map for Vision Transformer model Beginners	0	2097	April 12, 2024
Understanding what went wrong in attention Research	5	1653	July 31, 2020
Optimal methods to monitor attention matrices when doing training/inference using BERT-type models Intermediate	2	712	September 11, 2021

How to visualize attention of a large encoder-decoder transformer model that isn't a model on hugging face?

Related topics