this post was submitted on 12 Jul 2024
564 points (98.5% liked)
Technology
59589 readers
2891 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Why do you think that? The existing data sets won’t be going anywhere. Fine tuning doesn’t require nearly the same amount of training images and it’s not infeasible to get them from individual artists.
Not that that actually matters to open source developers, though, as the developer obligations only apply if you’re making the product available for a commercial purpose, so they’re not relevant to developers of gratis solutions - and most libre developers are also gratis developers. If your platform is not commercial and doesn’t have at least 25 Million monthly active users, you don’t need to allow users to add content provenance information in the first place. If it’s not for a commercial purpose, you aren’t prohibited from training on content containing content provenance information, or from removing it and training on it.
I'll be honest, I read it too fast and didn't see the "for commercial use part". I still think this is problematic because a lot of fine tuners and some companies putting out models either have a Patreon or offer their model for individual use but not to host on generating services without compensation (a good example of this is pony for fine tuners or codestal(I think) for general model providers). It also means any one building models can't then commercialize models on their end while still offering it for free to the community, it puts them in a tough position. I don't know how Metas llama could survive this or Google's gemma. I'm also curious how this affects huggingface since I'm not sure if they are making it available like it says in the bill by hosting it.
It does put the bill in a better light though and I will edit my comment.