Definition
The horizontal partitioning of high-dimensional vector embeddings across multiple distributed nodes to enable parallelized similarity searches; it facilitates massive-scale RAG by overcoming single-node memory limits, though it introduces architectural complexity in cross-shard result merging.
Distributing vector indices for parallel search, not just splitting relational database rows.
"Dividing a massive library into separate wings so that multiple researchers can search for different topics simultaneously without bottlenecking at a single entrance."
Conceptual Overview
The horizontal partitioning of high-dimensional vector embeddings across multiple distributed nodes to enable parallelized similarity searches; it facilitates massive-scale RAG by overcoming single-node memory limits, though it introduces architectural complexity in cross-shard result merging.
Disambiguation
Distributing vector indices for parallel search, not just splitting relational database rows.
Visual Analog
Dividing a massive library into separate wings so that multiple researchers can search for different topics simultaneously without bottlenecking at a single entrance.