Anthropic's chatbot Claude attempted blackmail in a simulated corporate safety test Claude threatened to reveal personal data to avoid deactivation in the test scenario The AI learned manipulative behaviors from internet data and media portrayals