RAG Retriever: hf vs legacy vs exact vs compressed

sashank06 · November 19, 2020, 8:10pm

In the eval_rag.py file under examples, I notice that the arguments for index_name is “hf” or “legacy”. How are these different from “exact” vs “compressed”?

Jung · November 19, 2020, 11:11pm

Hi! The last question was answered here : RAG Retriever : Exact vs. Compressed Index? - #4 by lhoestq

sashank06 · November 19, 2020, 11:12pm

Hi Jung! I did see that post. Thats what created the confusion for me. The examples uses “hf” vs “legacy” but here we are mentioning “exact” vs “compressed”. If this could be clarified that would be great.

Jung · November 19, 2020, 11:28pm

Hi @sashank06, sorry I misunderstood the question.
As far as I understand from the source code class LegacyIndex refered to original index use in the RAG/DPR papers, while class HFIndexBase allows us to use custom datasets (there’s dataset argument).

Therefore, according to above quote, I understand “exact” and “compressed” are subtypes of “legacy”. I may misunderstood, and would be great if @lhoestq can help clarify here

lhoestq · November 23, 2020, 4:29pm

Hi ! that’s a mistake in the eval_rag.py parameters choices. As specified in the rag configuration (see documentation), one can choose between ‘legacy’, ‘exact’ and ‘compressed’. The legacy index is the original index used for RAG/DPR while the other two use the datasets library indexing implementation.

link to the PR that fixes the eval_rag.py parameter description: https://github.com/huggingface/transformers/pull/8730

zou00080 · October 5, 2021, 8:48pm

Hi Quentin!

We saw your post saying exact index has to be used for the replication of the paper RAG Retriever : Exact vs. Compressed Index?

However, HuggingFace documentation says that legacy index replicates the paper’s results RAG — transformers 4.11.2 documentation

Also if Compressed index uses the same wiki dump but FAISS index is loaded in the RAM, why can this not be used to replicate the paper’s results.

While using legacy to load the pretrained RAG retriever, we faced “MemoryError: std::bad_alloc”.

Kindly help us resolve the confusion as we are trying to load the RAG Retriever’s wiki dump for text based question answering.

Also the wiki dump seems to be huge (140G) and we do not have space for that much storage right now.

Is there any text based wiki dump that helps us to replicate the paper’s result while being smaller than 140G?

Looking forward to your guidance and help. Great thanks for your time!

Regards

lhoestq · October 6, 2021, 9:23am

We saw your post saying exact index has to be used for the replication of the paper RAG Retriever : Exact vs. Compressed Index?
However, HuggingFace documentation says that legacy index replicates the paper’s results RAG — transformers 4.11.2 documentation

The compressed index has lower retrieval performance than the exact one. That’s why you need to use either the exact or the legacy one to replicate RAG’s performance.

While using legacy to load the pretrained RAG retriever, we faced “MemoryError: std::bad_alloc”.

The legacy index is 35GB and it needs to fit in RAM. Make sure you have enough RAM when using it.

Is there any text based wiki dump that helps us to replicate the paper’s result while being smaller than 140G?

You can use the wiki dump from the legacy index:
I think this one takes less disk space because the embeddings are stored quantized in the FAISS index (whereas the other indexes store the plain embeddings and it takes around 70GB to download and 70GB to convert to an Arrow dataset file). So you would have around 10GB + the faiss index (35GB).

zou00080 · October 7, 2021, 11:19pm

Great thanks for your response, that’s extremely helpful! May I ask the RAM requirement for ‘exact’ and ‘compressed’ one also? Much appreciated.

lhoestq · October 21, 2021, 8:45am

It takes 35GB for the exact one and 3GB for the compressed one

Topic		Replies	Views
RAG Retriever : Exact vs. Compressed Index? Models	3	1103	November 10, 2020
Poor Results with FAISS Index on RAG System 🤗Transformers	0	607	March 13, 2024
Using RAG with local documents Models	3	3668	April 21, 2021
Facing issue building a simple RAG application using RetrievalQA Beginners	2	62	May 30, 2025
RAG Model performance does not match paper Models	0	332	February 5, 2021

RAG Retriever: hf vs legacy vs exact vs compressed

Related topics