Glossary

A comprehensive, searchable index of Retrieval-Augmented Generation terms, definitions, and synonyms to support technical clarity and consistency.

Advanced Concepts

Semantic graphs

Advanced Concepts

Understanding cause-effect

Advanced Concepts

Finding relevant code snippets

Advanced Concepts

"What-if" analysis

Advanced Concepts

Transferring knowledge to smaller models

Advanced Concepts

Combining multiple models

Advanced Concepts

Merging embeddings/outputs

Advanced Concepts

Learning from rankings

Advanced Concepts

Virtual environment testing

Advanced Concepts

Non-text data retrieval

Advanced Concepts

Creating training examples

Advanced Concepts

Using tests as metrics

Advanced Embedding Techniques

Advanced Embedding Techniques

Vectors capturing surrounding context

Advanced Embedding Techniques

Unified space for multiple languages

Advanced Embedding Techniques

Full-dimensional continuous vectors

Advanced Embedding Techniques

Reducing vector dimensions while preserving relationships

Advanced Embedding Techniques

Models trained on specialized corpora (medical, legal)

Advanced Embedding Techniques

Number of dimensions in vector (e.g., 768, 1536)

Advanced Embedding Techniques

Models fine-tuned for specific retrieval tasks

Advanced Embedding Techniques

Multiple representations per document (title, content, etc.)

Advanced Embedding Techniques

Cross-language vector representations

Advanced Embedding Techniques

Compressing vectors by 97% through subvector coding

Advanced Embedding Techniques

Reducing precision of vectors (4-bit, 8-bit)

Advanced Embedding Techniques

/ **Sparse Vectors** - High-dimensional vectors with mostly zeros

SPLADE

SPLADESPLADE
Advanced Embedding Techniques

Sparse Lexical and DensE embedding combining sparse/dense approaches

Advanced Embedding Techniques

Techniques to reduce storage requirements

Advanced Retrieval & Learning

Advanced Retrieval & Learning

Lifelong learning systems

Advanced Retrieval & Learning

Multi-language adaptation

Advanced Retrieval & Learning

Adapting to specific domains

Advanced Retrieval & Learning

Learning from minimal examples

Advanced Retrieval & Learning

Learning from new data

Advanced Retrieval & Learning

Tracking information relevance over time

Advanced Retrieval & Learning

Updating stale information

Advanced Retrieval & Learning

Learning to learn quickly

Advanced Retrieval & Learning

Real-time model updates

Advanced Retrieval & Learning

Fine-tuning for RAG

Advanced Retrieval & Learning

Knowledge transfer across domains

Advanced Retrieval & Learning

No task-specific examples

Advanced Retrieval Methods

Advanced Retrieval Methods

Contextualized Late Interaction over BERT

Advanced Retrieval Methods

ColBERT applied to multimodal (vision) content

Advanced Retrieval Methods

Broadening query scope

Advanced Retrieval Methods

Neural dense retrieval approach

Advanced Retrieval Methods

LLM-generated synthetic documents

Advanced Retrieval Methods

Refining retrieval in steps

Advanced Retrieval Methods

Retrieving across multiple documents

Advanced Retrieval Methods

Multiple reformulations of single query

Advanced Retrieval Methods

Enriching query with synonyms and related terms

Advanced Retrieval Methods

Rewriting queries for better matching

Advanced Retrieval Methods

Transforming queries to improve retrieval

Advanced Retrieval Methods

Breaking complex queries into parts

Advanced Retrieval Methods

Adding alternative terms

Architectures & Models

BERT

BERTBERT
Architectures & Models

Bidirectional Encoder Representations from Transformers

BM25

BM25BM25
Architectures & Models

Best Matching 25

DPR

DPRDPR
Architectures & Models

Dense Passage Retrieval

HNSW

HNSWHNSW
Architectures & Models

Hierarchical Navigable Small World

Architectures & Models

Hypothetical Document Embeddings

IVF

IVFIVF
Architectures & Models

Inverted File

PQ

PQPQ
Architectures & Models

Product Quantization

RAFT

RAFTRAFT
Architectures & Models

Retrieval-Augmented Fine-Tuning

SPLADE

SPLADESPLADE
Architectures & Models

Sparse Lexical and DensE

TF-IDF

TF-IDFTF-IDF
Architectures & Models

Term Frequency-Inverse Document Frequency

Benchmarks & Datasets

BEIR

BEIRBEIR
Benchmarks & Datasets

Diverse information retrieval benchmark

Benchmarks & Datasets

Late interaction evaluation

Benchmarks & Datasets

Multi-hop question answering

MS-MARCO

MS-MARCOMS-MARCO
Benchmarks & Datasets

Large-scale IR benchmark

Benchmarks & Datasets

Long-form QA benchmark

Benchmarks & Datasets

Biomedical question answering

Benchmarks & Datasets

Large-scale QA dataset

Benchmarks & Datasets

Cross-lingual QA benchmark

Compliance & Ethics

Compliance & Ethics

Systematic discrimination

Compliance & Ethics

Identifying unfair patterns

CCPA

CCPACCPA
Compliance & Ethics

California Consumer Privacy Act

Compliance & Ethics

Understanding model decisions

Compliance & Ethics

Measuring equality

FINRA

FINRAFINRA
Compliance & Ethics

Financial regulatory compliance

GDPR

GDPRGDPR
Compliance & Ethics

General Data Protection Regulation

HIPAA

HIPAAHIPAA
Compliance & Ethics

Health Insurance Portability and Accountability Act

Compliance & Ethics

Ethical AI principles

SOX

SOXSOX
Compliance & Ethics

Sarbanes-Oxley compliance

Compliance & Ethics

Clear system explanation

Context & Token Management

Context & Token Management

Dividing content for token limits

Context & Token Management

Maintaining important information

Context & Token Management

Cutting text to fit limits

Context & Token Management

Context supporting response claims

Context & Token Management

Tokens in query and context

Context & Token Management

Tokens in generated response

Context & Token Management

Reusing previously computed prompts

Context & Token Management

Fixed-size moving context window

Context & Token Management

Condensing text to save tokens

Context & Token Management

Allocated tokens for retrieval and generation

Context & Token Management

Maximizing value per token

Context & Token Management

Maximum tokens LLM can process

Core RAG

ANN

ANNANN
Core RAG

Approximate Nearest Neighbor

IR

IRIR
Core RAG

Information Retrieval

LLM

LLMLLM
Core RAG

Large Language Model

NLP

NLPNLP
Core RAG

Natural Language Processing

QA

QAQA
Core RAG

Question Answering

RAG

RAGRAG
Core RAG

Retrieval-Augmented Generation

Data & Context Management

Data & Context Management

Storing context externally

Data & Context Management

Previous interactions for context

Data & Context Management

Retrieval from multiple systems

Data & Context Management

Querying across distributed knowledge sources

Data & Context Management

Persistent user preferences and patterns

Data & Context Management

Session and long-term memory integration

Data & Context Management

Dynamic knowledge base updates

Data & Context Management

Curating important memories

Data & Context Management

Short-term conversation state

Data & Context Management

Real-time document and response streaming

Data & Privacy

Data & Privacy

RBAC and ABAC

Data & Privacy

Tracking system actions

Data & Privacy

Recording user permissions

Data & Privacy

Removing identifying information

Data & Privacy

Tracking data origin and transformations

Data & Privacy

Source and history tracking

Data & Privacy

Data storage encryption

Data & Privacy

Network data encryption

Data & Privacy

Identifying personally identifiable information

Data & Privacy

Removing sensitive information

Data & Privacy

Replacing sensitive data

Data Structures

Data Structures

Sorted tree structure

Data Structures

Connected nodes and edges

Data Structures

Key-value storage

Data Structures

Priority queue structure

Data Structures

Semantic network

Data Structures

Graph with attributes

Data Structures

First-in-first-out

Data Structures

Probabilistic balanced structure

Data Structures

Prefix tree for strings

Database Features

Database Features

Transaction guarantees

Database Features

Conditional document selection

Database Features

Redundancy and failover

Database Features

Storing non-vector data

Database Features

Isolated data per tenant

Database Features

Data partition within database

Database Features

Copying data across nodes

Database Features

Handling growing data

Database Features

Distributing data across partitions

Database Features

Access control per tenant

Document & Data Management

Document & Data Management

Redundant content between consecutive chunks

Document & Data Management

Number of tokens or characters per segment

Document & Data Management

Breaking documents into manageable pieces for embedding

Document & Data Management

Complete collection of documents in knowledge base

Document & Data Management

Breaking text into logical units

Document & Data Management

Repository for original/source documents

Document & Data Management

Tracking changes and updates to source material

Document & Data Management

Character or token-based uniform segmentation

Document & Data Management

Adding contextual information (tags, dates, source)

Document & Data Management

Content-aware splitting based on meaning

Document & Data Management

/ **Agentic Chunking** - LLM-assisted intelligent document splitting

Document & Data Management

Pulling readable content from various formats

Document & Data Management

Dividing content while preserving context

Document Processing

Document Processing

Web scraping

Document Processing

Document parsing and chunking

Document Processing

Vision-language model(VLM) OCR

Document Processing

Open-source vision-language model(VLM) OCR

Document Processing

Vision-language model(VLM) OCR

Document Processing

Open-source vision-language model(VLM) OCR

Document Processing

PDF content extraction

Document Processing

PDF text extraction

Document Processing

Browser automation

Document Processing

OCR text recognition

Document Processing

Document parsing and chunking

Domain Applications

Domain Applications

API and documentation retrieval

Domain Applications

FAQ and ticket automation

Domain Applications

Infrastructure documentation

Domain Applications

Organizational information system

Domain Applications

Market data and compliance

Domain Applications

Fact-checking and verification

Domain Applications

Contract and precedent analysis

Domain Applications

/ **Clinical RAG** - Patient records and guidelines

Domain Applications

Property and regulation information

Domain Applications

Academic literature integration

Embedding Fundamentals

Embedding Fundamentals

Angle-based similarity metric between vectors

Embedding Fundamentals

Similarity calculation for normalized vectors

Embedding Fundamentals

Neural network converting text to vectors

Embedding Fundamentals

Multi-dimensional space where vectors are positioned

Embedding Fundamentals

Numerical vector representations of text capturing semantic meaning

Embedding Fundamentals

Straight-line distance between vectors

Embedding Fundamentals

Contextual understanding of text beyond keywords

Embedding Fundamentals

Finding documents by meaning rather than keywords

Embedding Fundamentals

Scaling vectors to unit length

Embedding Fundamentals

Numeric array encoding semantic content

Embedding Models

Open-source high-performance embeddings

CLIP

CLIPCLIP
Embedding Models

Vision-language embedding model

Embedding Models

Commercial multilingual embedding model

Embedding Models

Large-scale training datasets embeddings

Embedding Models

Embeddings from Meta's LLaMA models

Embedding Models

Open-source efficient embeddings

3,072-dimensional embedding model

1,536-dimensional embedding model

Embedding Models

Framework for semantic textual similarity

Embedding Models

Embeddings from Voyage AI models

Error Analysis

Error Analysis

Incorrect source citation

Error Analysis

New documents not retrievable

Error Analysis

Confident false information

Error Analysis

Retrieved docs not supporting query

Error Analysis

Fabricated information not in context

Error Analysis

Contradictory information

Error Analysis

Adversarial attack via input

Error Analysis

Relevant docs ranked too low

Error Analysis

Missing relevant documents

Error Analysis

Meaning divergence between query and context

Evaluation Tools & Frameworks

Evaluation Tools & Frameworks

Agent behavior tracking

Evaluation Tools & Frameworks

AI observability platform

Evaluation Tools & Frameworks

LLM evaluation with code

Evaluation Tools & Frameworks

Production monitoring and evaluation

Evaluation Tools & Frameworks

LangChain evaluation platform

Evaluation Tools & Frameworks

Prompt testing and comparison

Evaluation Tools & Frameworks

Data validation for outputs

RAGAS

RAGASRAGAS
Evaluation Tools & Frameworks

RAG evaluation framework

Foundational Terms

Foundational Terms

Query combined with retrieved context before generation

Foundational Terms

Maximum amount of text an LLM can process (tokens)

Foundational Terms

Information sources outside the LLM's training data

Foundational Terms

LLM component that synthesizes responses from retrieved context

Foundational Terms

Anchoring generated responses in retrieved facts to reduce hallucinations

Foundational Terms

Process of preparing and storing documents for retrieval

Foundational Terms

Structured or unstructured collection of documents and data

Foundational Terms

System component responsible for fetching relevant documents

Technique combining information retrieval with generative AI for grounded responses

Generation & Response Metrics

Generation & Response Metrics

How well response addresses question

Generation & Response Metrics

Semantic similarity to expected answer

Generation & Response Metrics

N-gram overlap with reference

Generation & Response Metrics

Correctness of source attribution

Generation & Response Metrics

Logical flow and readability

Generation & Response Metrics

Addressing all query aspects

Generation & Response Metrics

Predictions matching reference exactly

Generation & Response Metrics

Accuracy of claims in response

Generation & Response Metrics

Response grounded only in retrieved context

Generation & Response Metrics

Recall-oriented understudy evaluation

Generation & Response Metrics

Meaning-based comparison

Infrastructure & Deployment

Infrastructure & Deployment

Request routing and management

Infrastructure & Deployment

Content delivery network

Infrastructure & Deployment

AWS, Azure, GCP

Infrastructure & Deployment

Reusing database connections

Infrastructure & Deployment

Docker containers

Infrastructure & Deployment

Multi-machine setup

Infrastructure & Deployment

Placing systems closer to users

Infrastructure & Deployment

Adding more servers

Infrastructure & Deployment

Container orchestration

Infrastructure & Deployment

Distributing requests

Infrastructure & Deployment

Independent service components

Infrastructure & Deployment

On-premise deployment

Infrastructure & Deployment

Using more powerful servers

Intelligent RAG Patterns

Intelligent RAG Patterns

Iterative query refinement

Intelligent RAG Patterns

Dynamic strategy selection based on query type

Intelligent RAG Patterns

Autonomous agent-driven retrieval decisions

Intelligent RAG Patterns

Multiple retrieval paths in single query

Intelligent RAG Patterns

Post-generation error checking

Intelligent RAG Patterns

Retaining interaction history

Intelligent RAG Patterns

Rapid adaptation with few examples

Intelligent RAG Patterns

Coordination between specialized agents

Intelligent RAG Patterns

Self-reflective improvement mechanisms

Intelligent RAG Patterns

Internal critique and iteration

Intelligent RAG Patterns

Self-assessment of relevance

Intelligent RAG Patterns

Model self-evaluates and critiques own outputs

Intelligent RAG Patterns

RAG using external tools/APIs

Long-Context Handling

Long-Context Handling

Extended text handling

Long-Context Handling

100K+ token support

Long-Context Handling

Token allocation strategy

Long-Context Handling

Updating high-value information

Long-Context Handling

Preferring recent documents

Long-Context Handling

Importance weighting

Long-Context Handling

Choosing what to retain

Long-Context Handling

Time-aware retrieval

Machine Learning Concepts

Machine Learning Concepts

Focusing on relevant parts

BERT

BERTBERT
Machine Learning Concepts

Bidirectional Encoder Representations

Machine Learning Concepts

Two-part model architecture

Machine Learning Concepts

Adapting pre-trained models

GPT

GPTGPT
Machine Learning Concepts

Generative Pre-trained Transformer

Machine Learning Concepts

Deep learning models

Machine Learning Concepts

Learning useful features

Machine Learning Concepts

Elements attending to each other

Machine Learning Concepts

Using knowledge from one task for another

Machine Learning Concepts

Attention-based models

Metrics

AUC

AUCAUC
Metrics

Area Under Curve

BLEU

BLEUBLEU
Metrics

BiLingual Evaluation Understudy

EM

EMEM
Metrics

Exact Match

F1

F1F1
Metrics

F1 Score

MAP

MAPMAP
Metrics

Mean Average Precision

MRR

MRRMRR
Metrics

Mean Reciprocal Rank

NDCG

NDCGNDCG
Metrics

Normalized Discounted Cumulative Gain

ROC

ROCROC
Metrics

Receiver Operating Characteristic

ROUGE

ROUGEROUGE
Metrics

Recall-Oriented Understudy for GIST Evaluation

Monitoring & Observability

Monitoring & Observability

Infrastructure monitoring

Monitoring & Observability

LLM error analysis

Monitoring & Observability

Metrics visualization

Monitoring & Observability

Token usage and cost tracking

Monitoring & Observability

Application performance

Monitoring & Observability

Observability and evaluation

Monitoring & Observability

Metrics collection

Multimodal RAG

Multimodal RAG

Vector representation of audio

Multimodal RAG

Speech-to-text

Multimodal RAG

Text-to-image, image-to-text

Multimodal RAG

Scanned PDFs and photos

Multimodal RAG

Vector representations of images

Multimodal RAG

Finding similar images

Multimodal RAG

Handling text, images, video, audio

Text from images

Single space for multiple modalities

Multimodal RAG

Extracting frames and audio

Multimodal RAG

Model for image embeddings

QA on images

Optimization Techniques

Optimization Techniques

Grouping queries for efficiency

Optimization Techniques

Storing frequent results

Optimization Techniques

Minimizing expenses

Optimization Techniques

Terminating search early

Optimization Techniques

Improving retrieval speed

Optimization Techniques

Speeding up responses

Optimization Techniques

Distributed caching

Optimization Techniques

Using cheaper models first

Optimization Techniques

Reusing computed contexts

Optimization Techniques

Optimizing query execution

Optimization Techniques

In-memory cache layer

Optimization Techniques

More queries per second

Optimization Techniques

Reducing token usage

Orchestration & Framework Libraries

Orchestration & Framework Libraries

Multi-agent conversation framework

Orchestration & Framework Libraries

Agent team orchestration

Orchestration & Framework Libraries

Declarative LLM programming

Orchestration & Framework Libraries

End-to-end RAG framework

Orchestration & Framework Libraries

Comprehensive LLM orchestration framework

Orchestration & Framework Libraries

Stateful workflow graphs

Orchestration & Framework Libraries

Document indexing and retrieval framework

Orchestration & Framework Libraries

Lightweight agent coordination

Orchestration & Framework Libraries

.NET LLM integration

Performance Concepts

Performance Concepts

Data transfer rate

Performance Concepts

Complexity classification

Performance Concepts

Performance limiting factor

Performance Concepts

Model prediction time

Performance Concepts

Response time

Performance Concepts

Storage requirement

Performance Concepts

Performance measurement

Performance Concepts

Algorithm memory usage

Performance Concepts

Operations per unit time

Performance Concepts

Algorithm speed analysis

Personalization & Memory

Personalization & Memory

Monitoring user interactions

Personalization & Memory

Segmenting by user attributes

Personalization & Memory

Recent interaction memory

Personalization & Memory

Extreme user customization

Personalization & Memory

Organizing remembered information

Personalization & Memory

System for custom ranking

Personalization & Memory

Long-term knowledge storage

Personalization & Memory

Current conversation context

Personalization & Memory

Learning user preferences

Personalization & Memory

Custom vector representations

Platforms & Tools

AWS

AWSAWS
Platforms & Tools

Amazon Web Services

Platforms & Tools

Microsoft Azure

CLI

CLICLI
Platforms & Tools

Command Line Interface

FOSS

FOSSFOSS
Platforms & Tools

Free and Open-Source Software

GCP

GCPGCP
Platforms & Tools

Google Cloud Platform

GUI

GUIGUI
Platforms & Tools

Graphical User Interface

JSON

JSONJSON
Platforms & Tools

JavaScript Object Notation

REST

RESTREST
Platforms & Tools

Representational State Transfer

SDK

SDKSDK
Platforms & Tools

Software Development Kit

Prompting Techniques

Prompting Techniques

Comparing prompt variants

Synonyms:B Testing
Prompting Techniques

Step-by-step reasoning prompts

Prompting Techniques

Including retrieved documents in prompt

Prompting Techniques

Runtime prompt modification

Prompting Techniques

Including examples in prompt

Prompting Techniques

Crafting clear task instructions

Prompting Techniques

Enhancing prompt with context

Prompting Techniques

Testing prompt effectiveness

Prompting Techniques

Improving prompt quality

Prompting Techniques

Reusable prompt structure

Prompting Techniques

Managing prompt variations

Prompting Techniques

Base instructions for LLM behavior

Prompting Techniques

Query or request from user

RAG Variants & Techniques

RAG Variants & Techniques

RAG with enhanced retrieval techniques

RAG Variants & Techniques

Fundamental RAG pattern

RAG Variants & Techniques

Separate, independently upgradeable components

RAG Variants & Techniques

Integrated single system

RAG Variants & Techniques

Basic retrieve-then-generate pipeline

RAG Variants & Techniques

Classical single-stage retrieval approach

Ranking & Re-ranking

Ranking & Re-ranking

Model that scores query-document pairs jointly

Ranking & Re-ranking

Using BERT-like models to score pairs

Ranking & Re-ranking

Separate encoders for query and document

Ranking & Re-ranking

ML models for optimal ranking

NDCG@K

NDCG@KNDCG@K
Ranking & Re-ranking

Normalized Discounted Cumulative Gain ranking metric

Ranking & Re-ranking

Fraction of top-k results that are relevant

Ranking & Re-ranking

Algorithms determining result order

Ranking & Re-ranking

Reordering retrieved results by relevance

Ranking & Re-ranking

Fraction of all relevant docs in top-k

Ranking & Re-ranking

Assigning confidence to document relevance

Ranking & Re-ranking

Minimum score for document inclusion

Ranking & Re-ranking

Returning k highest-ranked documents

Ranking Algorithms

BM25

BM25BM25
Ranking Algorithms

Probabilistic ranking function (builds on TF-IDF)

Ranking Algorithms

k1 (term saturation), b (length norm)

How rare term is

Ranking Algorithms

Adjusting for document length

Ranking Algorithms

Standard BM25 implementation

Ranking Algorithms

Probability-based scoring

Ranking Algorithms

Ordering by relevance

Ranking Algorithms

How often term appears

Ranking Algorithms

Preventing TF dominance

TF-IDF

TF-IDFTF-IDF
Ranking Algorithms

Term Frequency-Inverse Document Frequency

Retrieval Metrics

Retrieval Metrics

Are top results ranked in order of relevance?

Retrieval Metrics

Does context contain info needed for answer?

Retrieval Metrics

Harmonic mean of precision and recall

F1@K

F1@KF1@K
Retrieval Metrics

F1 score at top-k results

Average precision across queries

Average rank of first relevant result

MRR@K

MRR@KMRR@K
Retrieval Metrics

MRR considering only top-k results

NDCG@K

NDCG@KNDCG@K
Retrieval Metrics

NDCG@top-k results

Ranking quality metric

Retrieval Metrics

Fraction of retrieved results that are relevant

Retrieval Metrics

Fraction of all relevant documents retrieved

Retrieval Metrics

Numerical measure of document relevance

Retrieval Metrics

Embedding-based relevance measure

Search Techniques

Search Techniques

Searching across all text fields

Search Techniques

Retrieving at different granularity levels

Search Techniques

Combining keyword and semantic search

Search Techniques

Traditional text matching using terms

Search Techniques

ColBERT-style token-level interactions

Search Techniques

Sequential retrieval steps with refinement

Search Techniques

Search using deep learning models

Search Techniques

Scoring document relevance to query

Combining ranking lists from multiple retrievers

Search Techniques

Meaning-based search (vs. keyword)

Search Techniques

Combining sparse (BM25) and dense (embeddings) methods

Security Threats

Security Threats

Crafted attack inputs

Security Threats

Corrupting training/knowledge bases

Security Threats

Denial-of-service attacks

Security Threats

Circumventing safety guardrails

Security Threats

Stealing model knowledge

Security Threats

Malicious input attacks

Security Threats

Controlling request volume

Selection & Filtering

Selection & Filtering

Filtering by field values

Selection & Filtering

Initial broad retrieval stage

Selection & Filtering

Removing duplicate or near-duplicate results

Selection & Filtering

Connecting mentions to knowledge base entities

Selection & Filtering

Multi-dimensional filtering

Selection & Filtering

Selecting documents by attributes

Selection & Filtering

Identifying entities in text

Selection & Filtering

Modifying queries for better results

Selection & Filtering

Combining results from multiple sources

Specialized Retrieval Approaches

Specialized Retrieval Approaches

Understanding cause-effect relationships

Specialized Retrieval Approaches

Identifying clusters in knowledge graphs

Specialized Retrieval Approaches

Synthesizing info across multiple sources

Specialized Retrieval Approaches

Graph of connected entities

Specialized Retrieval Approaches

/ **Knowledge-Graph-Aware Retrieval** - Using entity relationships

Specialized Retrieval Approaches

Incorporating structured knowledge

Specialized Retrieval Approaches

Multi-step logical inference

Specialized Retrieval Approaches

Following relationships for context

Specialized Retrieval Approaches

Retrieving from tables, databases, knowledge graphs

Storage Technologies

Storage Technologies

Index for dense vectors

Storage Technologies

Vectors across servers

Storage Technologies

Network structure indexing

Storage Technologies

Combined sparse and dense

Storage Technologies

Vectors stored in RAM

Storage Technologies

Mapping terms to documents

Storage Technologies

Vectors on disk

Storage Technologies

Index for sparse vectors

Storage Technologies

Hierarchical indexing

Storage Technologies

Optimized structure for vector storage

System-Level Metrics

System-Level Metrics

Uptime and reliability

System-Level Metrics

Token usage and infrastructure costs

System-Level Metrics

Time to generate response

System-Level Metrics

Storage and RAM requirements

System-Level Metrics

Time from query to results

System-Level Metrics

Performance as system grows

System-Level Metrics

Queries processed per unit time

System-Level Metrics

Number of tokens consumed

Techniques & Patterns

API

APIAPI
Techniques & Patterns

Application Programming Interface

CRAG

CRAGCRAG
Techniques & Patterns

Corrective RAG

ETL

ETLETL
Techniques & Patterns

Extract Transform Load

MCP

MCPMCP
Techniques & Patterns

Model Context Protocol

NER

NERNER
Techniques & Patterns

Named Entity Recognition

OCR

OCROCR
Techniques & Patterns

Optical Character Recognition

RRF

RRFRRF
Techniques & Patterns

Reciprocal Rank Fusion

Techniques & Patterns

Self-Reflective RAG

VQA

VQAVQA
Techniques & Patterns

Visual Question Answering

Text Processing

Text Processing

Standardizing letter case

Text Processing

Text standardization

Text Processing

Identifying text language

Text Processing

Converting to base form

Text Processing

Reducing words to root form

Text Processing

Removing common words

Text Processing

Breaking text into tokens

Text Processing

Cleaning spacing

Use Case Specific

Use Case Specific

Conversational interface

Use Case Specific

Creating source attributions

Use Case Specific

Finding relevant documents

Use Case Specific

Verifying claims

Use Case Specific

Question answering

Use Case Specific

Suggesting content

Use Case Specific

Meaning-based search

Use Case Specific

Condensing document content

Vector Database Platforms

Vector Database Platforms

Lightweight open-source embedded database

Vector Database Platforms

Full-text search with vector support

FAISS

FAISSFAISS
Vector Database Platforms

Facebook's high-performance similarity search library

Vector Database Platforms

Modern vector database with multi-modal support

Vector Database Platforms

Enterprise open-source vector database for massive scale

Vector Database Platforms

Vector capabilities in MongoDB

Vector Database Platforms

Managed vector database with hybrid search

Vector Database Platforms

Vector extension for PostgreSQL

Vector Database Platforms

Rust-based high-performance vector database

Vector Database Platforms

PostgreSQL with vector support

Vector Database Platforms

Specialized database optimized for storing and querying embeddings

Vector Database Platforms

Alternative term for vector database

Vector Database Platforms

Open-source vector database with GraphQL API

Vector Indexing & Search Algorithms

Vector Indexing & Search Algorithms

Fast inexact similarity search

Vector Indexing & Search Algorithms

Search over dense embeddings

HNSW

HNSWHNSW
Vector Indexing & Search Algorithms

Hierarchical Navigable Small World graph indexing

Vector Indexing & Search Algorithms

Data structure for efficient keyword retrieval

IVF

IVFIVF
Vector Indexing & Search Algorithms

Inverted File indexing for clustering

Vector Indexing & Search Algorithms

Finding k most similar items

Vector Indexing & Search Algorithms

Locating closest points in vector space

Vector Indexing & Search Algorithms

Finding items similar to query

Vector Indexing & Search Algorithms

Similarity-based retrieval using embeddings

Vector Indexing & Search Algorithms

Weak-AND for efficient pruning

Vector Similarity Metrics

Vector Similarity Metrics

Angle-based metric

Vector Similarity Metrics

Angle between vectors

Vector Similarity Metrics

Inner product of vectors

Vector Similarity Metrics

Straight-line distance

Vector Similarity Metrics

Bit-level differences

Vector Similarity Metrics

Set overlap measure

Vector Similarity Metrics

Euclidean distance norm

Vector Similarity Metrics

Grid-based distance

Workflow & Automation Platforms

Workflow & Automation Platforms

Data pipeline orchestration

Workflow & Automation Platforms

Rapid RAG development

Workflow & Automation Platforms

SaaS integration platform

Workflow & Automation Platforms

Visual workflow automation

Workflow & Automation Platforms

Workflow orchestration

Workflow & Automation Platforms

Low-code automation