Specify different attention masks for different layers

nachtsky · January 16, 2023, 6:04am

Hi, I am experimenting with some ideas of adding different attention masks for different transformer layers. Currently, I am trying to implement a customized model that accepts two attention masks. But I’m wondering if there’s any simpler method to do so? Any suggestion is welcome! Thanks!

Topic		Replies	Views
Training a model with custom attention masks in each layer 🤗Transformers	0	667	December 6, 2023
Specify attention masks for some heads in multi-head attention Intermediate	3	2343	November 17, 2020
Different masks for encoder self and cross attention 🤗Transformers	0	1099	November 8, 2022
Clarification on the attention_mask 🤗Transformers	4	23450	May 3, 2024
Role of attention mask in base Bert 🤗Transformers	0	329	December 22, 2022

Specify different attention masks for different layers

Related topics