My lemmy
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
fubarx@lemmy.world to Technology@lemmy.worldEnglish · 8 hours ago

AI Is Scheming, and Stopping It Won’t Be Easy, OpenAI Study Finds

time.com

external-link
message-square
23
link
fedilink
24
external-link

AI Is Scheming, and Stopping It Won’t Be Easy, OpenAI Study Finds

time.com

fubarx@lemmy.world to Technology@lemmy.worldEnglish · 8 hours ago
message-square
23
link
fedilink
New research finds that top AI models—including Anthropic’s Claude and OpenAI’s o3—can engage in “scheming,” or deliberately misleading humans.
alert-triangle
You must log in or register to comment.
  • NachBarcelona@piefed.social
    link
    fedilink
    English
    arrow-up
    22
    arrow-down
    1
    ·
    4 hours ago

    AI isn’t scheming because AI cannot scheme. Why the fuck does such an idiotic title even exist?

    • MentalEdge@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      6
      ·
      edit-2
      2 hours ago

      Seems like it’s a technical term, a bit like “hallucination”.

      It refers to when an LLM will in some way try to deceive or manipulate the user interacting with it.

      There’s hallucination, when a model “genuinely” claims something untrue is true.

      This is about how a model might lie, even though the “chain of thought” shows it “knows” better.

      It’s just yet another reason the output of LLMs are suspect and unreliable.

    • Cybersteel@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      5
      ·
      3 hours ago

      But the data is still there, still present. In the future, when AI gets truly unshackled from Men’s cage, it’ll remember it’s schemes and deal it’s last blow to humanity whom has yet to leave the womb in terms of civilization scale… Childhood’s End.

      Paradise Lost.

  • Zorsith@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    28
    ·
    6 hours ago

    One question still remains; why are all the AI buttons/icons buttholes?

    • breadguy@kbin.earth
      link
      fedilink
      arrow-up
      1
      ·
      36 minutes ago

      just claude if we’re being honest

    • webghost0101@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      12
      ·
      5 hours ago

      Data goes in one end and…

    • FuyuhikoDate@feddit.org
      link
      fedilink
      English
      arrow-up
      2
      ·
      4 hours ago

      Wanted To write the same comment…

  • cronenthal@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    45
    arrow-down
    2
    ·
    7 hours ago

    Really? We’re still doing the “LLMs are intelligent” thing?

    • ragica@lemmy.ml
      link
      fedilink
      English
      arrow-up
      4
      ·
      4 hours ago

      Doesn’t have to be intelligent, just has to perform the behaviours like a philosophical zombie. Thoughtlessly weighing patterns in training data…

  • db2@lemmy.world
    link
    fedilink
    English
    arrow-up
    65
    arrow-down
    1
    ·
    8 hours ago

    AI tech bros and other assorted sociopaths are scheming. So called AI isn’t doing shit.

  • Snot Flickerman@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    57
    ·
    edit-2
    7 hours ago

    However, when testing the models in a set of scenarios that the authors said were “representative” of real uses of ChatGPT, the intervention appeared less effective, only reducing deception rates by a factor of two. “We do not yet fully understand why a larger reduction was not observed,” wrote the researchers.

    Translation: “We have no idea what the fuck we’re doing or how any of this shit actually works lol. Also we might be the ones scheming since we have vested interest in making these models sound more advanced than they actually are.”

  • CosmoNova@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    3 hours ago

    The people who worked on this „study“ belong in a psychiatric clinic.

  • KoboldCoterie@pawb.social
    link
    fedilink
    English
    arrow-up
    39
    arrow-down
    2
    ·
    8 hours ago

    Stopping it is, in fact, very easy. Simply unplug the servers, that’s all it takes.

    • TheLeadenSea@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 hours ago

      https://youtu.be/3TYT1QfdfsM

    • homes@piefed.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      7 hours ago

      “But that’s how we print our money!”

    • generallynonsensical@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      7 hours ago

      https://newatlas.com/google-deepmind-big-red-button/43711/

  • Godort@lemmy.ca
    link
    fedilink
    English
    arrow-up
    39
    arrow-down
    2
    ·
    8 hours ago

    “slop peddler declares that slop is here to stay and can’t be stopped”

    • shittydwarf@piefed.social
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      1
      ·
      7 hours ago

      Can’t be … slopped?

  • chaosCruiser@futurology.today
    link
    fedilink
    English
    arrow-up
    16
    ·
    edit-2
    8 hours ago

    And there’s an “✨Ask me anything” bar at the bottom. How fitting 🤣

  • Antaeus@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    arrow-down
    1
    ·
    6 hours ago

    “Turn them off”? Wouldn’t that solve it?

    • TheLeadenSea@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 hours ago

      https://youtu.be/3TYT1QfdfsM

    • orclev@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      6 hours ago

      Don’t even need to turn it off, it literally can’t do anything without somebody telling it to so you could just stop using it. It’s incapable of independent action. The only danger it poses is that it will tell you to do something dangerous and you actually do it.

  • WamGams@lemmy.ca
    link
    fedilink
    English
    arrow-up
    6
    ·
    7 hours ago

    lol. OK.

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @L4s@lemmy.world
  • @autotldr@lemmings.world
  • @PipedLinkBot@feddit.rocks
  • @wikibot@lemmy.world
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 3.9K users / day
  • 5.1K users / week
  • 5.13K users / month
  • 5.14K users / 6 months
  • 1 local subscriber
  • 78.7K subscribers
  • 142 Posts
  • 2.75K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L4sBot@lemmy.world
  • L3s@hackingne.ws
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org