this post was submitted on 13 Feb 2024
120 points (91.7% liked)
Fediverse
28490 readers
636 users here now
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!
Rules
- Posts must be on topic.
- Be respectful of others.
- Cite the sources used for graphs and other statistics.
- Follow the general Lemmy.world rules.
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Calling it "stealing content" is loaded terminology. You're posting content on an open protocol whose very purpose is to broadcast it far and wide.
I'm posting on a protocol whose purpose is to post content to other platforms that use the same protocol.
Your posts are awfully public, given that goal. I mean, anybody can freely access them and use programs to copy them for any use they desire.
Isn't this the same logic used to justify using our posts to train large language models?
No. There is a difference in context and intent.
Bottom line is, if people are concerned of having their conversation and content distributed out of their intended audiences, we'll all have to move to a fully encrypted network, where every message can only be decoded by the intended recipients. Getting upset because other people are not agreeing to your expectations of privacy is pointless.
I don't think it's unreasonable to expect that platforms you're not a member don't host content you create in order to make it look at though their platform is more popular and vibrant than it is, thus generating revenue of which you're not going to get a share of.
Assuming you hold rights to your content in the legal system you'd be claiming the damages in, you are of course free to file a lawsuit.
That raises a point, what is the process regarding DMCA and GDPR? There's no mechanism to delete posts via scraped content.
And neither via federation like in AP. It's a bit of a hole that should the technology get truly big will eventually come to a head.
We already had lawsuits in Germany related to linking to copyright infringing content, it's not a big stretch that if you scrape or federate a link that could infringe, you are in turn infringing.
But with AP, you can purge content. When you scrape content, there's no way to get a purge notice.
Where exactly does a "platform" end? Is it only lemmy.tf, or all Lemmy instances? Either way Mastodon or Peertube can hardly be considered to be the same platform as Lemmy. Activitypub is a protocol and definitely not a platform. Or would you consider threads.net part of your "platform" once it implements Activitypub?
That's a good question. We have the Lemmy platform, but that's built on the wider ActivityPub platform. So I would consider threads part of the platform.
Can we please get out of this tribal mindset? The thing about decentralized systems is that it lets everyone where they want to be without being forced into a walled garden. Why should I care about the platform that other people are using, if I can reach them just the same?
Who cares if Bluesky or Nostr become more popular than ActivityPub? As long as the "platform" is open source and not actively working to hold its users as hostages, we should praise and hope they get to grow as large as possible. We should be fighting against the big corporations, not the small independent developers. There are almost 3 billion people using Twitter/Facebook/Reddit/TikTok. They are the ones that we should be actively engaging and trying to win them over to our side.
And also by the bots we use to copy reddit content here, I presume? 😛
I'm actually not a fan of copying Reddit content over to here either. I'm at least consistent that in my thought processes.
Fair enough.
I don't even truly know where I stand if I had to personally decide it. I guess i'm one of those filthy Neutrals, I have no strong opinion either way. 😑
I'm of the opinion that we don't value our content enough. We should respect it and people. But we should also believe in our community to create its own content. I get so much help here and am so grateful for all the people. I love every experience I have here and I believe everyone can have similar experiences if they stop trying to recreate old experiences.
What's wrong with the logic used to justify using public data to train large language models?
I don't even like the fact that we're forced to train AI via captcha. When we start getting paid for what we put in, I'll reconsider my stance.