The Hallucination Probe

the aha moment

Ask a small model 50 factual questions split into common-knowledge and long-tail buckets, score the answers with multi-alias matching, and plot the hallucination rate against the Kalai-Vempala theoretical lower bound. Measure ~5% on common questions, ~40-60% on long-tail — exactly where the proof says you should land.

Open in Colab View on GitHub

the facts

Time: 60–90 min
Hardware: CPU · Colab
Act: V · Where They Break
Status: Live
Artifact: A scored question bank + a measured-vs-theoretical hallucination-rate chart.

run it locally

Clone the labs repo and run this lab as a script or open it as a notebook:

git clone https://github.com/iqbal-sk/Microscale-labs.git
cd Microscale
just setup-auto      # auto-detects CPU / CUDA / Mac
just run 06
# or:  jupyter lab labs/06-hallucination-probe/lab.py

Full install options (uv, pip, or the platform-specific CUDA paths) are in the labs README.

read alongside

Lesson · 9 min · 45 xp

Hallucination is inevitable

The Kalai-Xu proof that any model trained on finite data must hallucinate — visualized interactively with epistemic uncertainty and benchmark bias

Open in Colab View on GitHub ← all labs