Vector DB - Exhaustive search in RAG

Streampunk99 · November 14, 2023, 7:58pm

I have a question about RAGs. When I’m querying a VDM using an LLM how exhaustive is the search?
For example, if I’m searching a corpus of legal documents for outcomes in specific case scenarios.
Is the answer I get back based on an exhaustive search of all indexes which contain relevant matches?
Does it just pull the best X matches based on similarity?
And/ Or do all those matches make it back into the decoder LLM to inform the answer? Or is there some kind of cut-off? And if so what?

The broader question here is, how well informed will a RAG answer be? And what use cases is it good for and bad for? E.g. will the answer be informed by all the stored information or just a subset of it?

Thanks!

Topic	Replies	Views
Seeking Advice on Processing Support Conversations for Efficient RAG Model Search Intermediate	50	September 9, 2024
Determining When to Search or Refine Answers in a RAG System Using Previous Context Beginners	185	November 18, 2024
Use embeddings stored in vector db to reduce work for LLM generating response Intermediate	1559	February 19, 2024
Best orchestrator for RAG over tabular data Beginners	333	July 16, 2024
Is there any way for a model to 'do math' on RAG material? Beginners	320	November 10, 2023

Vector DB - Exhaustive search in RAG

Related topics