Replace Causal Mask of T5 to custom mask

yoohell · February 21, 2024, 8:04am

I’m a beginner of nlp,and i’m using t5 for my experiment.My goal it’s to modify the mask of t5 decoder,as the official doc said,t5 use default causal mask for decoder.
transformers/src/transformers/modeling_utils.py at main · huggingface/transformers · GitHub
I wonder how i can modify this default mask and define it myself.

dblakely · February 21, 2024, 2:05pm

Hey @yoohell, you can create your own decoder_attention_mask and pass that to T5 instead of using the default

yoohell · February 21, 2024, 3:16pm

do you mean decoder_attention_mask of forward() function?
I found it in t5 doc,it’s said that :
decoder_attention_mask (torch.BoolTensor of shape (batch_size, target_sequence_length), optional) — Default behavior: generate a tensor that ignores pad tokens in decoder_input_ids. Causal mask will also be used by default.
and the casual mask matrix is built by buil-in function,is there anyway i can replace it with my custom matrix?
thanks for replying me!

fool-ai4 · October 29, 2024, 3:34am

Can I pass in a mask with a value of all 1’s to turn off the casual mask?

Topic		Replies	Views
Where does causal mask get generated for T5 decoder? Beginners	2	651	January 9, 2024
Is T5 expected to ignore padding tokens in `decoder_input_ids` when `decoder_attention_mask` is not provided 🤗Transformers	4	2690	April 5, 2023
Self-attention masking for T5 encoder? 🤗Transformers	0	1702	February 27, 2022
T5 as Decoder for OCR Models	8	851	November 20, 2024
How to denoise text using T5? 🤗Transformers	2	683	May 8, 2023

Replace Causal Mask of T5 to custom mask

Related topics