this post was submitted on 01 Mar 2026
50 points (87.9% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

67985 readers
92 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):

🏴‍☠️ Other communities

FUCK ADOBE!

Torrenting/P2P:

Gaming:


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

founded 2 years ago
MODERATORS
 

American companies are spending enormous sums to develop high-performing AI models. Distillation attacks are attempting to maliciously extract them — and nobody is doing much to stop it.

top 20 comments
sorted by: hot top controversial new old
[–] fushuan@lemmy.blahaj.zone 3 points 5 hours ago

Nobody is doing much to stop the American AI companies crawling the web to scrap tons of licensed content to illegally use in their training, either.

[–] M1k3y@discuss.tchncs.de 9 points 7 hours ago

Oh no, wouldn't anyone think of the billion dollar companies? The Chinese are stealing the models that they have spent so much effort on getting all the training data. What a shame.

[–] Draconic_NEO@lemmy.dbzer0.com 10 points 12 hours ago

If they open source it then that's a win. Closed source models don't help anyone because when the company goes bust they need to be reinvented again. People like to talk about the advancements capitalist industry has made but if they never publish any of it because of "tRaDE sEcrETs" they might as well have never done it because the next person will have to reinvent it when they go bust or kill it for money.

[–] HubertManne@piefed.social 36 points 19 hours ago

This article should be titled undertaxed american corporations waste trillions on ai.

[–] p03locke@lemmy.dbzer0.com 29 points 20 hours ago

Whoever wrote this article didn't even bother to do the most basic of research.

DeepSeek fully admitted they started with ChatGPT outputs to train its model. And then they released it as an open-source model, so that everybody else can "steal" their work. On the image/video front, the general public has created every possible variation on top of every model you can think of. On top of that, any model that has ever been released with full weights has been spun into whatever variation or VRAM size you want.

The ugly truth that the American companies want to hide is the fact that they are spending trillions of dollars on an oligopoly that they can't keep long-term. They hope that they can just keep spending more money to add more billions of parameters to their models, and keep technologically competitive with the secondary open-source models. But, they've already ran into diminishing returns over a year ago, and the global compute sector physically cannot keep up with demand for another cycle of even more diminishing returns.

The other factor is that realistic miniaturization of models is already here. Some of the smaller sizes aren't as effective as the 250GB models they use on cloud-based services, but you can still do a lot with a 16GB or 24GB video card, using models of those sizes. Optimization and LLM quantization is getting better and better each year. The AI bubble burst is going to force a cascade shift into a new era of localization. Everybody is sick to fucking death of renting and subscribing to everything. Us pirates already do so on the media front, and soon localization of LLMs is going to become way more popular.

The question isn't "Can people steal the tech?". It's "how long will people notice that it's already happening?"

[–] infinitesunrise@slrpnk.net 44 points 23 hours ago (1 children)

I would reckon that China is perfectly satisfied to let us be the sole host of the thing that is rapidly destroying our economy and trust in all media from the inside out.

[–] Jiggle_Physics@piefed.zip 13 points 18 hours ago* (last edited 18 hours ago) (1 children)

China is 38%, and growing, of the world's investment in LLMs

[–] infinitesunrise@slrpnk.net 7 points 14 hours ago* (last edited 14 hours ago) (1 children)

Most of that goes toward implementation (data centers) and chip manufacturing. China is making money on compute services and maintaining capability parity on software the good old fashioned pirate way merely to prevent a technology gap with the US, as is their way.

[–] Jiggle_Physics@piefed.zip 1 points 12 hours ago

That is not "allowing us to be the sole host of the thing that is destroying our economy and trust in all media from inside out". That is keeping parity with it. China is also having major issues with fabricated media from AI. The Chinese government has also latched on to AI, as many others, to manipulate media, and many other police state things. Their economy is heavily, heavily, invested in the success of llms. When this bubble bursts, it will be bad for every major economy on earth, as they are all disproportionately invested in this.

[–] Thedogdrinkscoffee@lemmy.ca 20 points 21 hours ago

Can China just steal what America just stole from everyone else?

[–] humanspiral@lemmy.ca 9 points 19 hours ago

Models getting better does give extra information for making newer models better too. China publishes far more advanced research than US models "steal", and they open source exceptionally strong/fast models that US can also steal from.

[–] Even_Adder@lemmy.dbzer0.com 18 points 22 hours ago (1 children)

Calling it a "Distillation Attack" is wild. Get fucked Anthropic.

[–] PatheticGroundThing@beehaw.org 2 points 2 hours ago

Some of the terms that have been coined to describe stuff related to AI are just so funny.

"Prompt injection attack", also known as... asking nicely for the chatbot to do a thing.

[–] fckreddit@lemmy.ml 14 points 22 hours ago

Yeah, because American LLMs are so immensely useful that people are throwing money at them.

[–] workgood@lemmy.dbzer0.com 7 points 20 hours ago

they did. its called deepseek

[–] Sims@lemmy.ml 8 points 21 hours ago

yadayada, more moronic 'China baad' propaganda.

[–] Ildsaye@hexbear.net 8 points 22 hours ago
[–] The_Walkening@hexbear.net 3 points 20 hours ago (1 children)

lol. love when people are panicking about this when it means that these things are basically interchangeable anyways. Didn't someone at Google write a memo that was like "we're kinda fucked b/c you can re-create this stuff with enough resources" like 2 years ago?

[–] p03locke@lemmy.dbzer0.com 5 points 18 hours ago

Didn’t someone at Google write a memo that was like “we’re kinda fucked b/c you can re-create this stuff with enough resources” like 2 years ago?

Basically, yes. They were specifically decrying the amount of open-sourcing they and their American competitors were doing, because capitalism, of course. Around this time, we had examples like StabilityAI's StableDiffusion and Meta's LLaMA as open-source models. And around this time, everybody else started closing their models, despite the fact that the research kept on going out in the open. StabilityAI kept their models open, mostly because they had no choice, but the attitude shifted towards profitability.

So, China took the open-source mantle, and these open/closed lines are being drawn strictly around national divisions as this American vs. China slant. Which is mostly a diversion of the real battle.

[–] kibiz0r@midwest.social 2 points 20 hours ago

I truly wish they could steal it away.