• 0 Posts
  • 26 Comments
Joined 7 months ago
cake
Cake day: December 5th, 2023

help-circle
  • A lot of this stems from instances running old versions with loose registration requirements, like no captcha. This is a problem in a federated system because there’s no barrier for a banned user to just jump to another instance.

    Perhaps it would be a good idea if, when Lemmy has anti-spam measures implemented like rate-limiting and captchas for registration, it disabled federation with instances that are at a lower version, to motivate small instances to upgrade and enable the new features.







  • We’re already at that point. Even recipe sites, which I’ll give the benefit of assuming aren’t already ML-generated, are already so similar, boring, and irrelevant that nobody reads them.

    In the past few months, I’ve also noticed a lot of sites showing up in my Google search results purporting to be relevant or answer my question, but when I actually read them they are also completely useless. For example, I couldn’t figure out how to take a friend’s Instagram story and reshare it to my own if I wasn’t tagged in it. Several pages were titled to look useful, but all of them gave only alternatives.













  • copyright only protects them from people republishing their content

    This is not correct. Copyright protects reproduction, derivation, distribution, performance, and display of a work.

    People also ingest their content and can make derivative works without problem. OpenAI are just doing the same, but at a level of ability that could be disruptive to some companies.

    Yes, you can legally make derivative works, but without license, it has to be fair use. In this case, where not only did they use one whole work in its entirety, they likely scraped thousands of whole NYT articles.

    This isn’t even really very harmful to the NYT, since the historical material used doesn’t even conflict with their primary purpose of producing new news.

    This isn’t necessarily correct either. I assume they sell access to their archives, for research or whatever. Being able to retrieve articles verbatim through chatgpt does harm their business.