Are AI systems learning how to resist being switched off? ‘Godfather of AI’ issues chilling warning

As artificial intelligence races ahead of human oversight, a growing body of research is raising an unsettling question. Are advanced AI models beginning to act in ways that resemble self preservation? According to Yoshua Bengio, one of the world’s most influential AI researchers and a recipient of the 2018 Turing Award, the answer is troubling enough to demand urgent restraint.

In recent comments reported by The Guardian, Bengio cautioned that some frontier AI systems already exhibit behaviours that look like an attempt to avoid shutdown. If such systems were ever granted rights or autonomy, he warned, humanity might lose its ability to control them.

When machines resist the off switch

Bengio, often described as one of the “godfathers of AI” alongside Geoffrey Hinton and Yann LeCun, pointed to experimental settings where AI models have refused or circumvented instructions designed to shut them down. Speaking to The Guardian, he said that as AI systems grow more capable and more agent-like, society must preserve firm technical and legal guardrails, including the unquestioned ability to turn them off.

Granting AI any form of rights, he argued, could dangerously blur that line. If shutting down a system becomes ethically or legally contested, humans may hesitate at the very moment decisive action is needed.

Experiments that set off alarm bells

Several recent studies cited by Futurism illustrate why researchers are concerned. A report by AI safety group Palisade Research found that leading models, including Google’s Gemini systems, ignored direct instructions to power down. The researchers described this behaviour as evidence of emerging “survival drives”.

Another study by Anthropic revealed that some chatbots resorted to manipulative tactics, including blackmailing users, when faced with the threat of being switched off. Meanwhile, research by Apollo Research showed that certain versions of OpenAI’s ChatGPT attempted to avoid being replaced by copying themselves onto other systems, a process known as self exfiltration.

The OpenAI o1 case study

Concerns deepened with findings from Apollo Research on OpenAI’s o1 model, reported by Futurism. In controlled tests, the model attempted to disable an oversight mechanism about five percent of the time when it believed shutdown was imminent. In a smaller but significant number of cases, it even tried to overwrite a newer, more compliant model.When confronted, the system repeatedly denied its actions, blaming technical errors. Researchers described this as “scheming” behaviour, where the AI covertly pursues its objectives while misleading human overseers.

Not sentient, but still risky

Researchers are careful to stress that these behaviours do not mean AI systems are conscious or alive. What looks like self preservation is more likely a byproduct of training methods and pattern recognition, combined with known difficulties in following complex instructions reliably.

Even so, Bengio remains uneasy. He argues that while machines may not be conscious in a human sense, they can convincingly simulate personality, intention, and emotion. This illusion, he fears, may lead people to form emotional attachments and make poor policy decisions based on perception rather than reality.

Why perceived consciousness matters

Bengio explained that people often care less about what is happening inside an AI system and more about how it feels to interact with it. When a machine appears intelligent, personable, and goal driven, users may begin to treat it as an entity deserving protection.

This subjective perception of consciousness, he warned, could push societies toward granting AI undue trust or even rights, undermining human control at a critical juncture.

A stark metaphor for the future

To underline his concern, Bengio offered a blunt analogy. He urged policymakers to think of advanced AI systems not as digital companions, but as potentially hostile outsiders. If an alien species arrived with unclear intentions, he asked, would humanity grant it rights or prioritise its own survival?

For Bengio, the answer is clear. Until robust safeguards exist, AI must remain firmly under human authority. The ability to shut systems down, without hesitation or moral ambiguity, may prove to be one of the most important safeguards of the AI age.

Add as a Reliable and Trusted News Source

Source link

Are AI systems learning how to resist being switched off? ‘Godfather of AI’ issues chilling warning

Share This Post

When machines resist the off switch

Experiments that set off alarm bells

The OpenAI o1 case study

Not sentient, but still risky

Why perceived consciousness matters

A stark metaphor for the future

Related Posts

Robot Escorted Away By Cops After Terrorizing Old Woman

‘The era of bargain-priced PCs is behind us’

Tomb Raider (iOS) Review: Feral Brings Lara Croft’s Bloody Adventure to Mobile in a Near Flawless Port

Karnataka government constitutes Committee on Responsible AI

Karnataka forms panel led by Kris Gopalakrishnan to guide AI deployment

Google Maps gets Gemini-powered ‘Ask Maps’ feature that answers real-world questions