Definition
AUC (Area Under the Curve) measures the aggregate performance of a binary classifier or retrieval model across all possible classification thresholds, specifically representing the probability that a randomly chosen relevant document is ranked higher than a randomly chosen irrelevant one. In RAG pipelines, it evaluates the robustness of the retrieval stage or the agent's intent classification before decision-making.
In AI, it refers to the performance of ranking and classification models, not geometric area or calculus integrals.
"A metal detector's sensitivity dial; AUC measures the device's inherent ability to distinguish gold from scrap metal, regardless of how high or low you set the 'alert' threshold."
Conceptual Overview
AUC (Area Under the Curve) measures the aggregate performance of a binary classifier or retrieval model across all possible classification thresholds, specifically representing the probability that a randomly chosen relevant document is ranked higher than a randomly chosen irrelevant one. In RAG pipelines, it evaluates the robustness of the retrieval stage or the agent's intent classification before decision-making.
Disambiguation
In AI, it refers to the performance of ranking and classification models, not geometric area or calculus integrals.
Visual Analog
A metal detector's sensitivity dial; AUC measures the device's inherent ability to distinguish gold from scrap metal, regardless of how high or low you set the 'alert' threshold.