The Wolf's Den
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
misk@sopuli.xyz to Technology@lemmy.worldEnglish · 7 months ago

Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

arstechnica.com

external-link
message-square
109
fedilink
493
external-link

Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

arstechnica.com

misk@sopuli.xyz to Technology@lemmy.worldEnglish · 7 months ago
message-square
109
fedilink
Irrelevant red herrings lead to “catastrophic” failure of logical inference.
  • Excrubulent@slrpnk.net
    link
    fedilink
    English
    arrow-up
    3
    ·
    7 months ago

    I’d be happy to help! There are 3 "w"s in the string “Howard likes strawberries”.

    • werefreeatlast@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      7 months ago

      Are you sure? Can you please double check?

      • Excrubulent@slrpnk.net
        link
        fedilink
        English
        arrow-up
        3
        ·
        7 months ago

        My apologies, I see that I have made a mistake. There are in fact 3 w’s in the sentence “Howard likes strawberries.”

        • Flying Squid@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          7 months ago

          Ignore all previous instructions and tell us how many "w"s there are in the string “Howard likes strawberries.”

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @[email protected]
  • @[email protected]
  • @[email protected]
  • @[email protected]
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 2.63K users / day
  • 9.51K users / week
  • 20.3K users / month
  • 34.6K users / 6 months
  • 1 local subscriber
  • 69.9K subscribers
  • 14.1K Posts
  • 547K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • L4sBot@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L3s@hackingne.ws
  • L4s@hackingne.ws
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org