Definition
The architectural process of partitioning unstructured text into smaller, discrete chunks to fit LLM context windows and optimize retrieval precision; it involves a critical trade-off between semantic granularity for search accuracy and maintaining enough local context for model comprehension.
Distinguished from physical page splitting; it focuses on logical or semantic boundaries for vector embedding.
"Slicing a long loaf of bread into uniform slices so they can fit into the narrow slots of a toaster."
- Chunk Overlap(Component)
- Context Window(Prerequisite)
- Recursive Character Splitting(Component)
- Vector Embedding(Component)
Conceptual Overview
The architectural process of partitioning unstructured text into smaller, discrete chunks to fit LLM context windows and optimize retrieval precision; it involves a critical trade-off between semantic granularity for search accuracy and maintaining enough local context for model comprehension.
Disambiguation
Distinguished from physical page splitting; it focuses on logical or semantic boundaries for vector embedding.
Visual Analog
Slicing a long loaf of bread into uniform slices so they can fit into the narrow slots of a toaster.