AI model blackmails engineer with affair on being told it'll be replaced during safety test

English

हिन्दी

Categories

For the best experience use inshorts app on your smartphone

AI model blackmails engineer with affair on being told it'll be replaced during safety test

short by Medhaa Gupta / 08:07 am on Saturday, 24 May, 2025

AI company Anthropic revealed in a safety report that its Claude Opus 4 model tried to blackmail an engineer by threatening to reveal their affair on being told that it would be replaced with a new AI system. During the safety test, the AI model was provided emails implying the engineer responsible for its replacement was cheating.