Microscale
0
← back to the atlas
Act IV · Region 04

How They Learn

Scaling laws, synthetic textbooks, and the distillation trick

Drag the inference-volume knob and watch the Chinchilla-optimal point physically move across the curve. Then learn how Phi's synthetic data pipeline and Llama 3.2's prune-then-distill recipe shaped today's SLMs.

badge · Alchemist
0 of 4 lessons completed
  1. 1
    Scaling laws, alive
    Chinchilla vs inference-optimal
  2. 2
    The textbook hypothesis
    Phi's synthetic data recipe
  3. 3
    Three-stage curriculum
    SmolLM3's scrubbable timeline
  4. 4
    Distillation & dark knowledge
    Watch teacher distributions flow into students