Definition
Reflective RAG is an architectural pattern that introduces self-correction loops into the retrieval-augmented generation process, where an LLM critiques its own retrieved context for relevance and its generated response for factual consistency. This iterative approach significantly reduces hallucinations by discarding 'noisy' documents and re-triggering retrieval or generation if quality thresholds are not met, trading increased latency and token cost for high-precision output.
Distinguished from 'Naive RAG' by its ability to recursively evaluate and fix its own errors before presenting a final answer.
"A research student who writes a draft, compares it against their source notes to find contradictions, and then re-writes the draft or visits the library again if information is missing."
- Self-RAG(Specific Framework)
- Corrective RAG (CRAG)(Component/Alternative Strategy)
- Hallucination Grading(Prerequisite)
- Agentic Workflow(Architectural Category)
Conceptual Overview
Reflective RAG is an architectural pattern that introduces self-correction loops into the retrieval-augmented generation process, where an LLM critiques its own retrieved context for relevance and its generated response for factual consistency. This iterative approach significantly reduces hallucinations by discarding 'noisy' documents and re-triggering retrieval or generation if quality thresholds are not met, trading increased latency and token cost for high-precision output.
Disambiguation
Distinguished from 'Naive RAG' by its ability to recursively evaluate and fix its own errors before presenting a final answer.
Visual Analog
A research student who writes a draft, compares it against their source notes to find contradictions, and then re-writes the draft or visits the library again if information is missing.