Determining When to Search or Refine Answers in a RAG System Using Previous Context

riddhi810 · November 18, 2024, 12:55pm

When working with a RAG system that uses previous context for answers, how does it determine whether to search the vector embeddings again or simply refine an answer from the previous conversation?

For example, in a RAG system that answers from uploaded context:

The first time, when asking for a list of leaves, it searches the vector database and provides the answer.
The second time, when asking to display the same list of leaves in a tabular format, it should not perform another search, but rather refine the previous answer.

How can the system distinguish between when a new search is needed versus when it should reuse and refine previous responses?

Topic		Replies	Views
Seeking Advice on Processing Support Conversations for Efficient RAG Model Search Intermediate	0	50	September 9, 2024
Vector DB - Exhaustive search in RAG Intermediate	0	321	November 14, 2023
RAG Model for QA Models	1	1336	December 30, 2023
Fine-Tuning + RAG based Chatbot: Dataset Structure & Instruction Adherence Issues Intermediate	7	367	March 11, 2025
In RAG systems, who's really responsible for hallucination... the model, the retriever, or the data? Models	3	67	June 27, 2025

Determining When to Search or Refine Answers in a RAG System Using Previous Context

Related topics