• Snot Flickerman@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    97
    arrow-down
    1
    ·
    edit-2
    9 months ago

    OpenAI wants to eat everyone’s lunch. They don’t even respect robots.txt.

    I’m gonna be real, I don’t think an AI company stealing every bit of data I ever put online to ingest into their AI so they can use it to make money really falls under “fair use.”

    If it were individuals, who were doing it for shits and giggles, and not to make a profit, I’d have a different view. Because I kind of don’t care if you’re just some random asshat making an art project and aren’t using it to try make beaucoup bucks on the back of the whole planets intellectual labor.

    Fair Use should be for humans not corporations. Fuck this “corporations are people too” schtick, they aren’t and the fact that they’re taking this data and using it to profit really should be viewed no differently than the argument that people made about The Pirate Bay, that through piracy, they were making money, which is what made it wrong.

    If we keep allowing the rich to just steal from the public commons because of dumb shit like corporations having the same rights as people, we’re barrelling headlong into a Cyberpunk Dystopia, if we’re not there already.

    Humans who are creating art and culture should have rights to Fair Use, not giant fucking corporations using it to profit wildly at the public expense. Copyright is all fucked up and broken, and this is a great example of why, because we’re aiming provisions that were meant to protect the public are now aimed at protecting corporations profits.

    • QuandaleDingle@lemmy.world
      link
      fedilink
      English
      arrow-up
      20
      arrow-down
      1
      ·
      edit-2
      9 months ago

      This is all exactly what I’ve been thinking, and better parsed too. If “we” don’t guide AI with the law now, we’re screwed.

      • umbrella@lemmy.ml
        link
        fedilink
        English
        arrow-up
        10
        ·
        9 months ago

        good luck trying to convince our capitalist overlords to stop making money.

        we need more than laws yo

        • GBU_28@lemm.ee
          link
          fedilink
          English
          arrow-up
          5
          ·
          9 months ago

          It’s not even that the lawmakers are capitalists, it’s that they are ancient, and have no clue about this. There are sitting congresspeople who proudly say they’ve never sent an email.

          • umbrella@lemmy.ml
            link
            fedilink
            English
            arrow-up
            1
            ·
            9 months ago

            I strongly doubt legislators are innocent old men, just caught off guard by tech, even if some might. They know what it is and are trying to figure out how to control it.

            And the lawmakers don’t even need to be capitalists themselves (even though they mostly are), they just need to be in their pockets. Its not a coincidence their legislation is almost always beneficial to corporations one way or the other.

            Hell, in the US lobbying is legal, its not even a secret.

            • GBU_28@lemm.ee
              link
              fedilink
              English
              arrow-up
              2
              ·
              9 months ago

              I never called anyone innocent and never implied it. I was highlighting the true issue is that our whole legislature is completely outclassed by the progress of tech.

    • Fisch@lemmy.ml
      link
      fedilink
      English
      arrow-up
      4
      ·
      9 months ago

      I don’t like corporations using the data I put online to train their AI either. I’d happily give it to people to train a FOSS AI tho. I’ve also contributed my voice to Mozillas Common Voice project, which is an open dataset that everyone is allowed to use. If it’s something that everyone can use and benefit from equally, I’m happy to help. I’m not happy to help some corporation make even more profits tho. At least not without getting payed.

  • AVincentInSpace@pawb.social
    link
    fedilink
    English
    arrow-up
    24
    ·
    edit-2
    9 months ago

    Yeah, they’ve been wanting that for a while. Microsoft and Google both already have AI search products, and the only thing we’ve really learned from those are that AI search products don’t work.

    Also

    Altman is Google’s nightmare that it can’t wake up from.

    Holy mother of dickriding. This article mainly just seems like a list of statements for AI tech bros to parrot to each other.

    • amzd@kbin.social
      link
      fedilink
      arrow-up
      7
      arrow-down
      1
      ·
      9 months ago

      ai search products don’t work

      Idk how you got to that conclusion. I’m trying to change my habit of googling coding problems because phind.com will 9/10 times have an answer faster and more accurate than if I googled it.

  • hperrin@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    ·
    9 months ago

    I can’t imagine that whatever they build will be any worse than Google Search.

  • nucleative@lemmy.world
    link
    fedilink
    English
    arrow-up
    14
    ·
    9 months ago

    OpenAI’s existential problem is that they’ll eat their own lunch and then have nothing left. The reason people make useful content now and give it away for free is because they can get paid for the traffic.

    Take that traffic away and all the content goes behind paywalls and login screens where OpenAI can’t touch it.

    • sunbeam60@lemmy.one
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      1
      ·
      9 months ago

      But the content has already been absorbed. I wouldn’t be surprised if they have all of it sucked up (many would argue illegally) and stored as a corpus for them to iterate onto. It’s not like they go out to touch all the web every time they train a new version of their model.

        • QuaternionsRock@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          9 months ago

          One of the craziest facts about GPT (to me) is that it was trained on 570GB of text data. That’s obviously a lot of text, but it’s bewildering to me that I could theoretically store their entire training dataset on my laptop.

    • GBU_28@lemm.ee
      link
      fedilink
      English
      arrow-up
      4
      ·
      9 months ago

      Lol people will literally bring openai with them past paywalls and logins.

      • Fisch@lemmy.ml
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 months ago

        Exactly, not getting money from almost all visits is still better than not getting any visits

  • GlitzyArmrest@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    ·
    9 months ago

    If they eventually do have a search engine, it definitely can’t be trusted. GPT-4 loves to hallucinate.

  • jdeath@lemm.ee
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    1
    ·
    9 months ago

    awesome. google ruined the internet, would be great for them to face an existential threat

  • AutoTL;DR@lemmings.worldB
    link
    fedilink
    English
    arrow-up
    4
    ·
    9 months ago

    This is the best summary I could come up with:


    The AI search engine could be a new feature for ChatGPT, or a potentially separate app altogether.

    Microsoft Bing would allegedly power the service from Sam Altman, which could be the most serious threat Google Search has ever faced.

    The report says this new search product could be faster than ChatGPT, without sacrificing its powerful summarizing abilities.

    Now, OpenAI is coming directly for the crown, in an attempt to dethrone the most dominant internet service of all time.

    Perplexity, a small AI search startup with backing from Jeff Bezos, is also competing for Google’s crown.

    If such a small team can steal 10 million people from Google, imagine what OpenAI and Microsoft can do.


    The original article contains 405 words, the summary contains 114 words. Saved 72%. I’m a bot and I’m open source!