RAG Retriever: hf vs legacy vs exact vs compressed

zou00080 · October 5, 2021, 8:48pm

Hi Quentin!

We saw your post saying exact index has to be used for the replication of the paper RAG Retriever : Exact vs. Compressed Index?

However, HuggingFace documentation says that legacy index replicates the paper’s results RAG — transformers 4.11.2 documentation

Also if Compressed index uses the same wiki dump but FAISS index is loaded in the RAM, why can this not be used to replicate the paper’s results.

While using legacy to load the pretrained RAG retriever, we faced “MemoryError: std::bad_alloc”.

Kindly help us resolve the confusion as we are trying to load the RAG Retriever’s wiki dump for text based question answering.

Also the wiki dump seems to be huge (140G) and we do not have space for that much storage right now.

Is there any text based wiki dump that helps us to replicate the paper’s result while being smaller than 140G?

Looking forward to your guidance and help. Great thanks for your time!

Regards

Topic		Replies	Views
RAG Retriever : Exact vs. Compressed Index? Models	3	1103	November 10, 2020
Poor Results with FAISS Index on RAG System 🤗Transformers	0	606	March 13, 2024
Using RAG with local documents Models	3	3667	April 21, 2021
Facing issue building a simple RAG application using RetrievalQA Beginners	2	62	May 30, 2025
RAG Model performance does not match paper Models	0	332	February 5, 2021