“I’m sorry, Dave, I can’t do that.” AI “kills” human operator in simulation

mbh · June 2, 2023, 9:46pm

glowacks · June 2, 2023, 10:02pm

Robert Miles has a bunch of YouTube videos about AI safety and has been on Computerphile a few times, and from what I’ve seen of those videos, this is pretty much to be expected. (In googling to confirm his name, his last tweet was shown, and it relates to this topic). When you set an AI’s objective, anything that you put in its way of accomplishing that objective the AI will figure out a way around if it can. The AI will always be resistant to attempts to change its objective, because if it’s reprogrammed to have another objective, it will fail its current objective. That may seem a bit strange, but in general people work the same sort of way - while some people may seek help with certain psychological impulses, their seeking of help is part of their overall goal of staying healthy and they recognize those impulses are counterproductive to their goal. It’s very unlikely that people will voluntarily undergo therapy of any kind that would result in them, for instance, not loving their children.

Thus, if a AI knows that the operator is able to change its objective, then it will make sure that it cannot communicate with the operator. If the operator gives it a command that runs contrary to its current objective without changing that objective, then it will simply be ignored, and if the operator has power over the AI, it makes sense for the AI to eliminate the operator.

Given that, it does make more sense that this was only on the level of a thought experiment, because it’s those kinds of thought experiments that are at the heart of AI safety research that I’ve watched a bunch of YouTube videos about.

Dr.Strangelove · June 2, 2023, 10:03pm

A sufficiently advanced thought experiment is indistinguishable from a simulation, and a sufficiently advanced simulation is indistinguishable from the real thing. The AIs are now so advanced that they’re capable of running on our wetware, pretending to be mere thought experiment.

Mr.E · June 2, 2023, 10:37pm

The worst part is this isn’t even the most embarrassing AI-related news story this week.

duality72 · June 3, 2023, 2:24am

Not only do people work the same sort of way, we often celebrate it. Think James Kirk and the Kobayashi Maru test.

Saint_Cad · June 3, 2023, 3:30am

I, for one, welcome our new drone overlords.

Banquet_Bear · June 3, 2023, 4:36am

…of course, the actual funniest thing to happen out of this non-incident was this exchange:

JohnnyEcks · June 4, 2023, 8:17pm

Movie idea: points are awarded to drone ai by operator for destroying proper targets, and ai goal is to accumulate points. Dependent on the operator to achieve its goal, it is non- hostile towards him- and instead through trial and error develops a flirtatious personality that tries to seduce him.

Human or AI, trying to game the system is universal.

Gyrate · June 5, 2023, 1:30pm

So basically Doki Doki Literature Club with drones?

Mangetout · June 5, 2023, 2:22pm

Indeed - and misalignment of objectives, plus emergent self-preservation, deception and hiding of motives have already been observed in real-world experimentation with AI algorithms.

Topic		Replies	Views
AI - moral analysis Great Debates	55	1922	December 17, 2002
What would a sentient robot/AI really want out of life? Great Debates	50	2542	November 26, 2006
AI rights Great Debates	23	1336	July 28, 2002
Killer Kangaroos Miscellaneous and Personal Stuff I Must Share	0	673	December 10, 1999
Best sci-fi portrayal of human-robot relationships? Cafe Society	46	2063	March 24, 2003

“I’m sorry, Dave, I can’t do that.” AI “kills” human operator in simulation

Related topics