Getting self-attention values of the GPT2LMHead model before softmax

kvarad · February 22, 2024, 3:06am

Hi! I’m trying to get the alignment values used for the attention weights before the softmax in the GPTLMHead model for a research project. Does anyone know if this is possible/how to do it?

I’m essentially trying to get the attention weights before this part of the model: transformers/src/transformers/models/gpt2/modeling_gpt2.py at 2a9b1f80c45cab19b542bc7cc004937d39d6f6fb · huggingface/transformers · GitHub.

I would appreciate any help!

Topic		Replies	Views
How to access raw attention logits? Beginners	2	663	March 11, 2024
Extract Attention Weights from a Specific Layer and Head Efficiently 🤗Transformers	1	156	March 25, 2025
Key-value pair from attention layer of GPT2 🤗Transformers	0	327	June 28, 2023
Loading weights of specific layer of gpt2 pretrained model Beginners	0	210	December 12, 2023
Swapping GPT-2 Attention with Flash Attention 🤗Transformers	3	3020	June 4, 2023

Getting self-attention values of the GPT2LMHead model before softmax

Related topics