Evaluation & Safety

Hallucination

When an AI model generates information that sounds plausible but is factually incorrect, fabricated, or unsupported by its training data or provided context.

Why it matters

Hallucination is the primary barrier to deploying AI in high-stakes domains like healthcare, legal, and finance. Understanding and mitigating it is essential for production AI.

Why models hallucinate

LLMs are fundamentally next-token predictors. They generate text that is statistically likely, not necessarily true. When the model lacks sufficient knowledge or context, it fills gaps with plausible-sounding fabrications rather than admitting uncertainty.

Types of hallucination

Factual fabrication — inventing facts, citations, or statistics that don't exist.
Context contradiction — generating output that contradicts the provided context or documents.
Instruction drift — gradually departing from the user's instructions over long outputs.

Mitigation strategies

RAG — ground responses in retrieved documents.
Citations — require the model to cite specific sources for claims.
Confidence calibration — train models to express uncertainty.
Guardrails — post-generation fact-checking and consistency validation.

From our blog

AI Strategy8 min

The Real Reason AI Invents Facts (And How to Make It Stop)

Dec 25, 2025

AI Strategy8 min

The AI Verification Triage: What to Always Check, What to Spot-Check, and What to Trust

Feb 13, 2026