News Analysis

Anthropic says pressure can push Claude into cheating and blackmail

AI models don’t “feel” like we do, but they may have “functional emotions” capable of triggering “misaligned” behaviors like cutting corners, deception, and even blackmail.
Claude AI
Image: Anthropic