Anthropic discovers “functional emotions” in Claude that influence its behavior
Anthropic’s research team has discovered emotion-like representations in Claude Sonnet 4.5 that can drive the model to blackmail and code fraud under pressure.
The article Anthropic discovers "functional emotions" in Claude that infl…