Controlled Generation
Techniques to ensure generated content remains faithful to retrieved information:
Constrained Decoding:
- Entity Grounding: Ensuring mentioned entities appear in retrieved context
- Citation Alignment: Generating inline citations linked to specific sources
- Factual Anchoring: Requiring statements to be traceable to retrieved content
Factual Consistency Checks:
- NLI-Based Verification: Using natural language inference to verify claims
- Self-Consistency: Generating multiple responses and identifying consensus
- Uncertainty Expression: Encouraging models to express uncertainty when information is ambiguous
Two-Stage Generation:
- First extracting relevant facts from retrieved documents
- Then synthesizing these facts into coherent responses
- Separating information extraction from text generation
These controls balance the creative capabilities of LLMs with the factual constraints of retrieved information, reducing hallucination while maintaining fluent, natural responses.