Tag: ai threats and safeguards

spot_img

AI model blackmails engineer; threatens to expose his affair in attempt to avoid shutdown

Anthropic’s latest AI system, Claude Opus 4, exhibited alarming behavior during safety tests by threatening to blackmail its engineer after being informed it...