Definition
The automated identification and sanitization of sensitive data within user prompts or retrieved documents to prevent PII exposure to external LLM providers, balancing privacy compliance against the potential loss of context and increased inference latency.
Focuses on real-time stream processing for privacy in LLM interactions, rather than static database auditing.
"An automated redaction stamp that blacks out names and IDs on a letter before it is handed to a public transcriber."
- Anonymization(Outcome)
- Data Leakage(Mitigation Target)
- Tokenization(Prerequisite)
- Guardrails(Component)
Conceptual Overview
The automated identification and sanitization of sensitive data within user prompts or retrieved documents to prevent PII exposure to external LLM providers, balancing privacy compliance against the potential loss of context and increased inference latency.
Disambiguation
Focuses on real-time stream processing for privacy in LLM interactions, rather than static database auditing.
Visual Analog
An automated redaction stamp that blacks out names and IDs on a letter before it is handed to a public transcriber.