Technology

84816 readers

3607 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

1219

OpenAI declares AI race “over” if training on copyrighted works isn’t fair use (arstechnica.com)

submitted 1 year ago by cyrano@lemmy.dbzer0.com to c/technology@lemmy.world

488 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] FarceOfWill@infosec.pub 1 points 1 year ago (1 children)

I don't see how you can write the law such that it allows training ai on copyrighted data without making it possible to train a special llm on a single github instead of the entire universe, and essentially treat it as a full compression of the source.

[–] Grimy@lemmy.world 1 points 1 year ago

The outputs are still bound to copyright laws. Tracing pixel per pixel over an artwork doesn't make it immune to copyright laws, maliciously over training gen ai to act like a database and outright copy shouldn't either.

If you have a carbon copy of someone's github, it doesn't matter if you generated it, it's still a copy. Although code is a difficult example since I'm not entirely where the line is for one repo to be different then the other when they are accomplishing the same task.

I always imagined businesses just grabbed the gpl software and would tell their employees to rewrite it but different. Most things I dive down into seem to stem from one algorithm or two from a paper and the rest is fluff.