VisionEncoderDecoder X-Attn Question

prithivida · June 17, 2022, 2:51am

@nielsr - I am trying to understand the X-attn code in VisionEncoderDecoder architecture. I can see the comment in the code " Cross-attention layers are automatically added to the decoder and should be fine-tuned on a downstream generative task, like image captioning."

What is the best place to see the actual implementation of the X-attn between V Encoder and L Decoder?

Please advice.

Thanks
Prithivi

Topic		Replies	Views
Adding cross-attention to custom models 🤗Transformers	2	3595	October 21, 2022
Using EncoderDecoderModel 🤗Transformers	4	1080	October 28, 2021
Replacing the decoder of an xxxEncoderDecoderModel 🤗Transformers	2	1711	December 16, 2023
How to implement custom vision encoder-decoder? 🤗Transformers	1	706	August 1, 2023
How can i implement custom model to use Seq2SeqTrainer class 🤗Transformers	0	444	November 8, 2023

VisionEncoderDecoder X-Attn Question

Related topics