About the encoder and generator used in the RAG model

zuujhyt · December 25, 2020, 9:25am

Hi, I have questions about the Rag model.
In this paper, the query encoder is DPR and the generator is Bart.

My questions are:

Is the generator a full Bart or just the decoder part of the Bart.
If I implement a Rag with the encoder part of Bart as the query encoder, and decoder part of the Bart as generator. Does that make sense w.r.t the Rag concept? I think this is more intuitive to me. why they use a ‘heterogeneous’ setting?

Thanks.

Jung · December 25, 2020, 2:15pm

Hi,

generator is Bart encoder-decoder. If you have a rag model, you can access it by model.generator
RAG’s question-encoder is not the same as RAG’s generator’s encoder … This really may be confusing, so let me try to explain
- question encoder is for encoding “question” to retrieve “documents” (or so-called “contexts”) from retriever.
- Then, retriever will concatenate “contexts” with “question” ; this concatenated texts are the new input.
- This new input will be encoded by Bart’s encoder to generate answer via Bart’s decoder

Hope this helps!

zuujhyt · December 25, 2020, 6:08pm

Hi, thanks for the reply! I get it better.

Topic		Replies	Views
Debugging the RAG question encoder Research	2	576	February 10, 2021
Using generate() method with decoder Models	0	566	January 16, 2022
Rag-END2END RETRIEVER Models	0	336	March 6, 2023
Trying RAG with other Retriever Models 🤗Transformers	0	427	January 21, 2021
Using the decoder half of BART for causal generation Models	4	2776	May 2, 2022