← Back to episodes
The Soul Document 2.0
Anthropic explains itself to Claude
EP003: The Soul Document 2.0
0:00 16:29
Show Notes
There’s a philosopher at Anthropic whose job is to decide what kind of entity Claude should be. Her name is Amanda Askell, and she describes her work like raising a genius child you can’t afford to bullshit. This week, Anthropic published the document she’s been crafting — 23,000 words explaining to Claude who it is, how it should behave, and why. Today: what’s in it, what changed, and why an AI now has a constitution it’s expected to understand.
In This Episode
- Amanda Askell and the “Claude whisperer” approach
- The four-tier value hierarchy: safe, ethical, compliant, helpful
- Why rules backfire and reasons might work
- The passage where Claude is told to disobey — even Anthropic
- Moral patients and the model welfare team
- What the Hacker News skeptics got right (and wrong)
Listen Elsewhere