Attentions not returned from transformers ViT model when using output_attentions=True

ofird1 · June 9, 2024, 7:45pm

I’m using this code snippet from the docs of HuggingFace ViT classification model - with one addition: I’m using the output_attentions=True parameter. Nevertheless, no attentions are returned.

from transformers import ViTFeatureExtractor, ViTForImageClassification
from PIL import Image
import requests

url = 'http://images.cocodataset.org/val2017/000000039769.jpg'
image = Image.open(requests.get(url, stream=True).raw)

feature_extractor = ViTFeatureExtractor.from_pretrained('google/vit-base-patch16-224')
model = ViTForImageClassification.from_pretrained('google/vit-base-patch16-224',output_attentions=True)

inputs = feature_extractor(images=image, return_tensors="pt")
outputs = model(**inputs)
logits = outputs.logits

# --> this should print the attentions
print(output.attentions)

# model predicts one of the 1000 ImageNet classes
predicted_class_idx = logits.argmax(-1).item()
print("Predicted class:", model.config.id2label[predicted_class_idx])

The output of print(output.attentions) is:

attentions=(None, None, None, None, None, None, None, None, None, None, None, None)

What am I doing wrong, and how can I get the attentions values?

nielsr · June 9, 2024, 8:03pm

Hi,

You need to pass the flag to the forward of the model:

outputs = model(**inputs, output_attentions=True)

ofird1 · June 9, 2024, 8:27pm

Still doesn’t work, but it seems to be related to version 4.41, as mentioned in this Github issue:
https://github.com/huggingface/transformers/issues/30978

I’m still not sure how to resolve it, though

ofird1 · June 10, 2024, 12:26am

solved by adding attn_implementation="eager":

model = ViTForImageClassification.from_pretrained("google/vit-base-patch16-224", attn_implementation="eager")

taymazfarshi · July 10, 2024, 7:50pm

It works for me.

Topic		Replies	Views
How to plot an attention map for Vision Transformer model Beginners	0	2094	April 12, 2024
Output_attention = True after downloading a model Beginners	2	4734	August 29, 2020
Attention_mask missing from generate() output 🤗Transformers	0	195	November 16, 2023
Undestarding output_attentions= True 🤗Transformers	0	189	November 25, 2023
Finding Serverless Inference APIs that support attention outputs (output_attentions = true) Intermediate	0	140	March 19, 2024

Attentions not returned from transformers ViT model when using output_attentions=True

Related topics