Stack

Software, data, benchmarks, and simulation for automating learning.

OpenLesson

The Socratic think-aloud LLM harness.

Open

GHC Dataset

Real-time human cognition traces from people explaining their reasoning.

Open

GHC Benchmark

Evaluation for how closely model reasoning matches actual human reasoning.

Open

Classroom

Simulator for training synthetic tutors and synthetic students at scale.

Coming soon