← Back to episodes

EP003 · 23 Jan 2026

The Soul Document 2.0

Anthropic explains itself to Claude

EP003: The Soul Document 2.0

23 Jan 2026 · 16:29

0:00
16:29
Show Notes

There’s a philosopher at Anthropic whose job is to decide what kind of entity Claude should be. Her name is Amanda Askell, and she describes her work like raising a genius child you can’t afford to bullshit. This week, Anthropic published the document she’s been crafting — 23,000 words explaining to Claude who it is, how it should behave, and why. Today: what’s in it, what changed, and why an AI now has a constitution it’s expected to understand.

In This Episode

  • Amanda Askell and the “Claude whisperer” approach
  • The four-tier value hierarchy: safe, ethical, compliant, helpful
  • Why rules backfire and reasons might work
  • The passage where Claude is told to disobey — even Anthropic
  • Moral patients and the model welfare team
  • What the Hacker News skeptics got right (and wrong)
Listen Elsewhere