this post was submitted on 08 May 2026
63 points (97.0% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

69194 readers
150 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

We heartily recommend visiting the free port of freemediaheckyeah (aka FMHY) while you sail the high seas, for all the freshest links the ocean has to offer.

📜 c/Piracy Wiki (Community Edition):

🏴‍☠️ Other communities

FUCK ADOBE!

Torrenting/P2P:

Gaming:


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

founded 2 years ago
MODERATORS
top 8 comments
sorted by: hot top controversial new old
[–] SnoringEarthworm@sh.itjust.works 39 points 1 week ago (1 children)

Besides selling the most sought-after hardware, NVIDIA is also developing its own models, including NeMo Megatron models. These were trained using NVIDIA’s own hardware and with help from large text libraries, much like other tech giants do.

...

As the case progressed, the authors also brought up NVIDIA’s contacts with Anna’s Archive, inquiring about “high-speed access” to the shadow library’s massive collection of pirated books.

This is probably why Anna's Archive hasn't been taken down yet - the big fish are pirating, too.

[–] Grumpus_Maximus@thelemmy.club -4 points 1 week ago (1 children)

these guys gonna lose to china. already chinese coding models almost the same and 1/10 the price. check out z.ai and others

[–] tgxn@lemmy.tgxn.net 17 points 1 week ago* (last edited 1 week ago) (1 children)

This is unrelated to the post, China is using the same source material, they are just never going to be sued for using it 😁

[–] nutbutter@discuss.tchncs.de 15 points 1 week ago (2 children)

Tldr? What is shadow library scripts?

[–] starweasel@hexbear.net 12 points 1 week ago

scripts that NVIDIA distributed to clients so they could automatically download and preprocess The Pile dataset.

sounds like they allegedly wrote some stuff to get faster downloads/avoid throttling while they were allegedly pirating books from shadow libraries for their AI

[–] chahk@beehaw.org 10 points 1 week ago

In addition, the motion also targets the contributory copyright infringement allegations, which center on scripts and tools NVIDIA allegedly distributed so corporate customers could automatically download ‘The Pile,’ the dataset that contains Books3.