Definition
Vector representations where semantically equivalent text segments from different languages are mapped to proximate coordinates in a shared latent space. In RAG pipelines, this enables cross-lingual retrieval, allowing a query in one language to find relevant context in a source document of another language without explicit machine translation.
It is semantic alignment in vector space, not a word-for-word translation process.
"A universal map where the word 'Water' and 'Agua' both point to the exact same GPS coordinates on a semantic globe."
Conceptual Overview
Vector representations where semantically equivalent text segments from different languages are mapped to proximate coordinates in a shared latent space. In RAG pipelines, this enables cross-lingual retrieval, allowing a query in one language to find relevant context in a source document of another language without explicit machine translation.
Disambiguation
It is semantic alignment in vector space, not a word-for-word translation process.
Visual Analog
A universal map where the word 'Water' and 'Agua' both point to the exact same GPS coordinates on a semantic globe.