Controlled Generation

Techniques to ensure generated content remains faithful to retrieved information:

Constrained Decoding:

  • Entity Grounding: Ensuring mentioned entities appear in retrieved context
  • Citation Alignment: Generating inline citations linked to specific sources
  • Factual Anchoring: Requiring statements to be traceable to retrieved content

Factual Consistency Checks:

  • NLI-Based Verification: Using natural language inference to verify claims
  • Self-Consistency: Generating multiple responses and identifying consensus
  • Uncertainty Expression: Encouraging models to express uncertainty when information is ambiguous

Two-Stage Generation:

  • First extracting relevant facts from retrieved documents
  • Then synthesizing these facts into coherent responses
  • Separating information extraction from text generation

These controls balance the creative capabilities of LLMs with the factual constraints of retrieved information, reducing hallucination while maintaining fluent, natural responses.