Visualize matrix inference for Roberta (Transformer)

Lyriccoder · August 30, 2022, 8:15am

Is there any framework which demonstrates each operation of pipeline (inference) for Roberta (or just any transformer, seq2seq) with explanation of size of matrices?

E.g., input: I am a student

Make one-hot encoding, matrix 256*1000m, matrix A
Create matrix B=A * (1000100)
…
n) Transpose Q= q_t
n+1) Transpose V = v_t
n+1) Transpose K = k_t
n+k) q_tk_t
etc.

I need to understand the inference for transformers as it is (and matrices sizes).
I also need that visuzliation an explanation to implement a similar sequence of matrix operations (multiplication, etc.) to measure speed of inference for random input. It will look like I do not have transformers package, but the procedure of inference were implemented manually with different sequential matrix operations

samr · August 31, 2022, 9:57am

Are you aware of bertviz? From the docs:

The neuron view visualizes individual neurons in the query and key vectors and shows how they are used to compute attention.

More detail in their medium post here, and you can see a demo of the neuron view in the third plot in their colab notebook.

The neuron view in notebook looks like this:

However, the one in the blog post seems closer to what you are asking for:

I would also check out the illustrated transformer and the annotated transformer.

Topic		Replies	Views
Optimal methods to monitor attention matrices when doing training/inference using BERT-type models Intermediate	2	710	September 11, 2021
Pre-Training From Scratch 🤗Transformers	0	1003	October 6, 2021
How to visualize attention of a large encoder-decoder transformer model that isn't a model on hugging face? 🤗Transformers	0	2317	June 28, 2021
Error when trying to visualize attention in T5 model Beginners	4	1641	March 20, 2024
Vision Transformer reconstruct image 🤗Transformers	2	1093	July 21, 2022

Visualize matrix inference for Roberta (Transformer)

Related topics