Strategic reasoning

The Guess

measure (and deepen) strategic reasoning at scale.

AI safety

Cognitive task

Captured cognition a strategic-depth and theory-of-mind dataset — a human baseline for evaluating AI strategic reasoning

Local atlas

Cognition dashboard

0signals
0public
0avg confidence
0msmedian latency
Submit a signal to see the first reflection.

Commons loop

Recent signals

Cloudflare deployment

Durable Objects