In Donut Where the output of swin diffused with the text->1.At the starting of Bart encoder,2. cross attention(K,V from swin,Q from attention) of second attention of Bart encoder,3.directly the decoder part of BART

shubham05 · August 2, 2023, 8:28am

is it the same architecture AS follows

is it trained or test in same manner as follows

Topic		Replies	Views
Finetune Donut with new tokenizer Intermediate	6	2443	October 10, 2023
BartDecoder outputs perfect predictions even when untrained Beginners	0	148	October 27, 2023
Bart summarization Beginners	3	1636	August 10, 2020
Funetune BART for text auto-encoder Models	0	450	November 22, 2022
Using BART models encoder and decoder Models	1	626	November 22, 2022