• Null User Object@lemmy.world
    link
    fedilink
    English
    arrow-up
    222
    ·
    11 months ago

    threatens to “financially ruin” the entire AI industry

    No. Just the LLM industry and AI slop image and video generation industries. All of the legitimate uses of AI (drug discovery, finding solar panel improvements, self driving vehicles, etc) are all completely immune from this lawsuit, because they’re not dependent on stealing other people’s work.

    • A Wild Mimic appears!@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      26
      ·
      11 months ago

      But it would also mean that the Internet Archive is illegal, even tho they don’t profit, but if scraping the internet is a copyright violation, then they are as guilty as Anthropic.

      • magikmw@piefed.social
        link
        fedilink
        English
        arrow-up
        27
        ·
        11 months ago

        IA doesn’t make any money off the content. Not that LLM companies do, but that’s what they’d want.

        • axmo@lemmy.ca
          link
          fedilink
          English
          arrow-up
          17
          ·
          11 months ago

          Profit (or even revenue) is not required for it to be considered an infringement, in the current legal framework.

        • CosmoNova@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          11 months ago

          And this is exactly the reason why I think the IA will be forced to close down while AI companies that trained their models on it will not only stay but be praised for preserving information in an ironic twist. Because one side does participate in capitalism and the other doesn’t. They will claim AI is transformative enough even when it isn’t because the overly rich invested too much money into the grift.

      • carg@feddit.org
        link
        fedilink
        English
        arrow-up
        1
        ·
        11 months ago

        Scrapping the Internet is not illegal. All AI companies did much more beyond that, they accessed private writings, private code, copyrighted images. they scanned copyrighted books (and then destroyed them), downloaded terabytes of copyrighted torrents … etc

        So, the message is like piracy is OK when it’s done massively by a big company. They’re claiming “fair use” and most judges are buying it (or being bought?)

  • halcyoncmdr@lemmy.world
    link
    fedilink
    English
    arrow-up
    135
    ·
    11 months ago

    As Anthropic argued, it now “faces hundreds of billions of dollars in potential damages liability at trial in four months” based on a class certification rushed at “warp speed” that involves “up to seven million potential claimants, whose works span a century of publishing history,” each possibly triggering a $150,000 fine.

    So you knew what stealing the copyrighted works could result in, and your defense is that you stole too much? That’s not how that works.

    • zlatko@programming.dev
      link
      fedilink
      English
      arrow-up
      39
      ·
      11 months ago

      Actually that usually is how it works. Unfortunately.

      *Too big to fail" was probably made up by the big ones.

      • Signtist@bookwyr.me
        link
        fedilink
        English
        arrow-up
        8
        ·
        11 months ago

        This is the real concern. Copyright abuse has been rampant for a long time, and the only reason things like the Internet Archive are allowed to exist is because the copyright holders don’t want to pick a fight they could potentially lose and lessen their hold on the IPs they’re hoarding. The AI case is the perfect thing for them, because it’s a very clear violation with a good amount of public support on their side, and winning will allow them to crack down even harder on all the things like the Internet Archive that should be fair use. AI is bad, but this fight won’t benefit the public either way.

        • A Wild Mimic appears!@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          4
          ·
          11 months ago

          I wouldn’t even say AI is bad, i have currently Qwen 3 running on my own GPU giving me a course in RegEx and how to use it. It sometimes makes mistakes in the examples (we all know that chatbots are shit when it comes to the r’s in strawberry), but i see it as “spot the error” type of training for me, and the instructions themself have been error free for now, since i do the lesson myself i can easily spot if something goes wrong.

          AI crammed into everything because venture capitalists try to see what sticks is probably the main reason public opinion of chatbots is bad, and i don’t condone that too, but the technology itself has uses and is an impressive accomplishment.

          Same with image generation: i am shit at drawing, and i don’t have the money to commission art if i want something specific, but i can generate what i want for myself.

          If the copyright side wins, we all might lose the option to run imagegen and llms on our own hardware, there will never be an open-source llm, and resources that are important to us all will come even more under fire than they are already. Copyright holders will be the new AI companies, and without competition the enshittification will instantly start.

          • Signtist@bookwyr.me
            link
            fedilink
            English
            arrow-up
            2
            ·
            11 months ago

            What you see as “spot the error” type training, another person sees as absolute fact that they internalize and use to make decisions that impact the world. The internet gave rise to the golden age of conspiracy theories, which is having a major impact on the worsening political climate, and it’s because the average user isn’t able to differentiate information from disinformation. AI chatbots giving people the answer they’re looking for rather than the truth is only going to compound the issue.

            • A Wild Mimic appears!@lemmy.dbzer0.com
              link
              fedilink
              English
              arrow-up
              1
              ·
              11 months ago

              I agree that this has to become better in the future, but the technology is pretty young, and i am pretty sure that fixing this stuff has a high priority in those companies - it’s bad PR for them. But the people are already gorging themselves on faulty info per social media - i don’t see that chatbots are making this really worse than it already is.

  • chaosCruiser@futurology.today
    link
    fedilink
    English
    arrow-up
    64
    ·
    11 months ago

    Oh no! Building a product with stolen data was a rotten idea after all. Well, at least the AI companies can use their fabulously genius PhD level LLMs to weasel their way out of all these lawsuits. Right?

    • Rooskie91@discuss.online
      link
      fedilink
      English
      arrow-up
      45
      ·
      11 months ago

      I propose that anyone defending themselves in court over AI stealing data must be represented exclusively by AI.

        • BakerBagel@midwest.social
          link
          fedilink
          English
          arrow-up
          4
          ·
          11 months ago

          “ooh, so sorry, but your LLM was trained on proprietary documents stolen from several major law firms, and they are all suing you now”

      • chaosCruiser@futurology.today
        link
        fedilink
        English
        arrow-up
        1
        ·
        11 months ago

        That would be glorious. If the future of your company depends on the LLM keeping track of hundreds of details and drawing the right conclusions, it’s game over during the first day.

    • thesohoriots@lemmy.world
      link
      fedilink
      English
      arrow-up
      9
      ·
      11 months ago

      PhD level LLM = paying MAs $21/hr to write summaries of paragraphs for them to improve off of. Google Gemini outsourced their work like this, so I assume everyone else did too.

  • PushButton@lemmy.world
    link
    fedilink
    English
    arrow-up
    48
    ·
    11 months ago

    Let’s go baby! The law is the law, and it applies to everybody

    If the “genie doesn’t go back in the bottle”, make him pay for what he’s stealing.

    • Zetta@mander.xyz
      link
      fedilink
      English
      arrow-up
      32
      ·
      11 months ago

      The law absolutely does not apply to everybody, and you are well aware of that.

      • BussyCat@lemmy.world
        link
        fedilink
        English
        arrow-up
        7
        ·
        11 months ago

        It’s not because they would only train on things they own which is an absolute tiny fraction of everything that everyone owns. It’s like complaining that a rich person gets to enjoy their lavish estate when the alternative is they get to use everybody’s home in the world.

          • BussyCat@lemmy.world
            link
            fedilink
            English
            arrow-up
            5
            ·
            11 months ago

            They have 0.2T in assets the world has around 660T in assets which as I said before is a tiny fraction. Obviously both hold a lot of assets that aren’t worthwhile to AI training such as theme parks but when you consider a single movie that might be worth millions or billions has the same benefit for AI training as another movie worth thousands. the amount of assets Disney owned is not nearly as relevant as you are making it out to be

  • [object Object]@lemmy.ca
    link
    fedilink
    English
    arrow-up
    36
    ·
    11 months ago

    Well maybe they shouldn’t have done of the largest violations of copyright and intellectual property ever.

    Probably the largest single instance ever.

    • potoooooooo 🥔@lemmy.world
      link
      fedilink
      English
      arrow-up
      9
      ·
      11 months ago

      I feel like it can’t even be close. What would even compete? I know I’ve gone a little overboard with my external hard drive, but I don’t think even I’m to that level.

  • westingham@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    34
    ·
    11 months ago

    I was reading the article and thinking “suck a dick, AI companies” but then it mentions the EFF and ALA filed against the class action. I have found those organizations to be generally reputable and on the right side of history, so now I’m wondering what the problem is.

      • Jason2357@lemmy.ca
        link
        fedilink
        English
        arrow-up
        13
        ·
        11 months ago

        Take scraping. Companies like Clearview will tell you that scraping is legal under copyright law. They’ll tell you that training a model with scraped data is also not a copyright infringement. They’re right.

        I love Cory’s writing, but while he does a masterful job of defending scraping, and makes a good argument that in most cases, it’s laws other than Copyright that should be the battleground, he does, kinda, trip over the main point.

        That is that training models on creative works and then selling access to the derivative “creative” works that those models output very much falls within the domain of copyright - on either side of a grey line we usually call “fair use” that hasn’t been really tested in courts.

        Lets take two absurd extremes to make the point. Say I train an LLM directly on Marvel movies, and then sell movies (or maybe movie scripts) that are almost identical to existing Marvel movies (maybe with a few key names and features altered). I don’t think anyone would argue that is not a derivative work, or that falls under “fair use.” However, if I used literature to train my LLM to be able to read, and used that to read street signs for my self-driving car, well, yeah, that might be something you could argue is “fair use” to sell. It’s not producing copy-cat literature.

        I agree with Cory that scraping, per se, is absolutely fine, and even re-distributing the results in some ways that are in the public interest or fall under “fair use”, but it’s hard to justify the slop machines as not a copyright problem.

        In the end, yeah, fuck both sides anyway. Copyright was extended too far and used for far too much, and the AI companies are absolute thieves. I have no illusions this type of court case will do anything more than shift wealth from one robber-barron to another, and won’t help artists and authors.

        • kibiz0r@midwest.social
          link
          fedilink
          English
          arrow-up
          4
          ·
          11 months ago

          I agree, and I think your points line up with Doctorow’s other writing on the subject. It’s just hard to cover everything in one short essay.

    • peoplebeproblems@midwest.social
      link
      fedilink
      English
      arrow-up
      8
      ·
      11 months ago

      I disagree with the EFF and ALA on this one.

      These were entire sets of writing consumed and reworked into poor data without respecting the license to them.

      Honestly, I wouldn’t be surprised if copyright wasn’t the only thing to be the problem here, but intellectual property as well. In that case, EFF probably has an interest in that instead. Regardless, I really think it need to be brought through court.

      LLMs are harmful, full stop. Most other Machine Learning mechanisms use licensed data to train. In the case of software as a medical device, such as image analysis AI, that data is protected by HIPPA and special attention is already placed in order to utilize it.

      • vala@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        1
        ·
        11 months ago

        My guess is that the EFF is mostly concerned with the fact this is a class action and also worried about expanding copyright in general.

    • pelya@lemmy.world
      cake
      link
      fedilink
      English
      arrow-up
      6
      ·
      11 months ago

      AI coding tools are using the exact same backends as AI fiction writing tools, so it would hurt the fledgling vibe coder profession (which according to proper software developers should not be allowed to exist at all).

  • guyincognito@piefed.social
    link
    fedilink
    English
    arrow-up
    29
    ·
    11 months ago

    I thought it was hilarious how there was a quote in the article that said

    immense harm not only to a single AI company, but to the entire fledgling AI industry and to America’s global technological competitiveness

    It will only do this because all these idiotic American companies fired all their employees to replace them with AI. Hire then back and the edge won’t dull. But we all know that they won’t do this and just cry and point fingers wondering how they ever lost a technology race.

    Edited because it’s my first time using quotes and I don’t know how to use them properly haha

  • Deflated0ne@lemmy.world
    link
    fedilink
    English
    arrow-up
    28
    ·
    11 months ago

    Good. Burn it down. Bankrupt them.

    If it’s so “critical to national security” then nationalize it.

    • A Wild Mimic appears!@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      9
      ·
      11 months ago

      the “burn it down” variant would only lead to the scenario where the copyright holders become the AI companies, since they have the content to train it. AI will not go away, it might change ownership to someone worse tho.

      nationalizing sounds better; even better were to put in under UNESCO-stewardship.

  • 9point6@lemmy.world
    link
    fedilink
    English
    arrow-up
    27
    ·
    11 months ago

    Probably would have been cheaper to license everything you stole, eh, Anthropic?

    • Modern_medicine_isnt@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      11 months ago

      This issue is not so cut and dry. The AI companies are stealing from other companies more than ftom individual people. Publishing companies are owned by some very rich people. And they want thier cut.

      This case may have started out with authors, but it is mentioned that it could turn into publishing companies vs AI companies.

  • Treczoks@lemmy.world
    link
    fedilink
    English
    arrow-up
    24
    ·
    11 months ago

    Well, theft has never been the best foundation for a business, has it?

    While I completely agree that copyright terms are completely overblown, they are valid law that other people suffer under, so it is 100% fair to make them suffer the same. Or worse, as they all broke the law for commercial gain.

    • No1@aussie.zone
      link
      fedilink
      English
      arrow-up
      1
      ·
      11 months ago

      Well, theft has never been the best foundation for a business, has it?

      History would suggest otherwise.

  • Lexam@lemmy.world
    link
    fedilink
    English
    arrow-up
    18
    ·
    11 months ago

    No it won’t. Just their companies. Which are the ones making slop. If your AI does something actually useful it will survive.

  • arararagi@ani.social
    link
    fedilink
    English
    arrow-up
    15
    ·
    11 months ago

    Meanwhile some Italian YouTuber was raided because some portable consoles already came with roms in their memory, they only go after individuals.

  • WereCat@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    ·
    11 months ago

    We just need to show that ChatGPT and alike can generate Nintendo based content and let it fight out between them