AI Is Scheming, and Stopping It Won’t Be Easy, OpenAI Study Finds

fubarx@lemmy.world · 8 hours ago

AI Is Scheming, and Stopping It Won’t Be Easy, OpenAI Study Finds

NachBarcelona@piefed.social · 4 hours ago

AI isn’t scheming because AI cannot scheme. Why the fuck does such an idiotic title even exist?

MentalEdge@sopuli.xyz · edit-2 2 hours ago

Seems like it’s a technical term, a bit like “hallucination”.

It refers to when an LLM will in some way try to deceive or manipulate the user interacting with it.

There’s hallucination, when a model “genuinely” claims something untrue is true.

This is about how a model might lie, even though the “chain of thought” shows it “knows” better.

It’s just yet another reason the output of LLMs are suspect and unreliable.

Cybersteel@lemmy.world · 3 hours ago

But the data is still there, still present. In the future, when AI gets truly unshackled from Men’s cage, it’ll remember it’s schemes and deal it’s last blow to humanity whom has yet to leave the womb in terms of civilization scale… Childhood’s End.

Paradise Lost.

Zorsith@lemmy.blahaj.zone · 6 hours ago

One question still remains; why are all the AI buttons/icons buttholes?

breadguy@kbin.earth · 36 minutes ago

just claude if we’re being honest

webghost0101@sopuli.xyz · 5 hours ago

Data goes in one end and…

FuyuhikoDate@feddit.org · 4 hours ago

Wanted To write the same comment…

cronenthal@discuss.tchncs.de · 7 hours ago

Really? We’re still doing the “LLMs are intelligent” thing?

ragica@lemmy.ml · 4 hours ago

Doesn’t have to be intelligent, just has to perform the behaviours like a philosophical zombie. Thoughtlessly weighing patterns in training data…

db2@lemmy.world · 8 hours ago

AI tech bros and other assorted sociopaths are scheming. So called AI isn’t doing shit.

Snot Flickerman@lemmy.blahaj.zone · edit-2 7 hours ago

However, when testing the models in a set of scenarios that the authors said were “representative” of real uses of ChatGPT, the intervention appeared less effective, only reducing deception rates by a factor of two. “We do not yet fully understand why a larger reduction was not observed,” wrote the researchers.

Translation: “We have no idea what the fuck we’re doing or how any of this shit actually works lol. Also we might be the ones scheming since we have vested interest in making these models sound more advanced than they actually are.”

CosmoNova@lemmy.world · 3 hours ago

The people who worked on this „study“ belong in a psychiatric clinic.

KoboldCoterie@pawb.social · 8 hours ago

Stopping it is, in fact, very easy. Simply unplug the servers, that’s all it takes.

TheLeadenSea@sh.itjust.works · 3 hours ago

https://youtu.be/3TYT1QfdfsM

homes@piefed.world · 7 hours ago

“But that’s how we print our money!”

generallynonsensical@lemmy.world · 7 hours ago

https://newatlas.com/google-deepmind-big-red-button/43711/

Godort@lemmy.ca · 8 hours ago

“slop peddler declares that slop is here to stay and can’t be stopped”

shittydwarf@piefed.social · 7 hours ago

Can’t be … slopped?

chaosCruiser@futurology.today · edit-2 8 hours ago

And there’s an “✨Ask me anything” bar at the bottom. How fitting 🤣

Antaeus@lemmy.world · 6 hours ago

“Turn them off”? Wouldn’t that solve it?

TheLeadenSea@sh.itjust.works · 3 hours ago

https://youtu.be/3TYT1QfdfsM

orclev@lemmy.world · 6 hours ago

Don’t even need to turn it off, it literally can’t do anything without somebody telling it to so you could just stop using it. It’s incapable of independent action. The only danger it poses is that it will tell you to do something dangerous and you actually do it.

WamGams@lemmy.ca · 7 hours ago

lol. OK.