Backstabbing, bluffing and playing dead: has AI learned to deceive?

Loading player...
As AI systems have grown in sophistication, so has their capacity for deception, according to a new analysis from researchers at Massachusetts Institute of Technology (MIT). Dr Peter Park, an AI existential safety researcher at MIT and author of the research, tells Ian Sample about the different examples of deception he uncovered, and why they will be so difficult to tackle as long as AI remains a black box. Help support our independent journalism at theguardian.com/sciencepod
13 May 2024 English United Kingdom Science · Nature

Other recent episodes

Mythos: are fears over new AI model panic or PR?

Earlier this month the AI company Anthropic said it had created a model so powerful that, out of a sense of responsibility, it was not going to release it to the public. Anthropic says the model, Mythos Preview, excels at spotting and exploiting vulnerabilities in software, and could pose a…
21 Apr 15 min

Everything you need to know about Artemis II so far

This week Artemis II’s four-astronaut crew broke Apollo 13’s distance record, becoming the humans to travel the farthest from Earth. Now on their way home, the team has experienced tech malfunctions, views like no other and moments of intense emotion, all in under 10 days. To find out about all…
9 Apr 19 min