Extracting attention mask from Qwen model

AlexJephtha · January 24, 2025, 1:12am

Hi. I’m trying to extract the various attention masks from the output of the Qwen/Qwen2-VL-7B-Instruct model. Here is an overview of what I’m doing:

self.model_id = "Qwen/Qwen2-VL-7B-Instruct"
self.base_model = Qwen2VLForConditionalGeneration.from_pretrained(
    self.model_id,
    torch_dtype=torch.float16,
    device_map="auto",
    low_cpu_mem_usage=True,
    cache_dir=self.cache_dir
)
self.processor = AutoProcessor.from_pretrained(self.model_id)

raw_input = self.processor(
    images=images,
    text=prompts,
    return_tensors='pt',
    padding=True
).to(0, torch.float16)

outputs = list()
raw_outputs = self.base_model.generate(**raw_input, max_new_tokens=200)
for raw_output in raw_outputs:
    outputs.append(self.processor.decode(raw_output, skip_special_tokens=True))
return outputs

How can I alter this to provide attention masks? Thanks.

AlexJephtha · January 24, 2025, 2:51am

I’ve altered my raw_output code as:

raw_outputs = self.base_model.generate(**raw_input, max_new_tokens=200, output_attentions=store_attention, return_dict_in_generate=True)

Now I just need to work out how to interpret the attention outputs

Topic		Replies	Views
Get attention masks from HF pipelines Beginners	0	375	December 4, 2023
Is attention_mask in LanguageModels such as GPT2LMHeadModel related to attention mechanism is it just to specify padding tokens Beginners	2	206	June 27, 2024
How to use `inputs_embed` and `attention_mask` together? Intermediate	1	924	May 19, 2024
BertForPretraining hidden_states extraction with input embeddings as inputs Models	0	397	June 4, 2022
Bert attention mask question 🤗Transformers	4	1201	March 11, 2024

Extracting attention mask from Qwen model

Related topics