this post was submitted on 08 May 2024
1717 points (99.3% liked)

Technology

59693 readers
3035 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] patatahooligan@lemmy.world 27 points 6 months ago (17 children)

This has nothing to do with centralization. AI companies are already scraping the web for everything useful. If you took the content from SO and split it into 1000 federated sites, it would still end up in a AI model. Decentralization would only help if we ever manage to hold the AI companies accountable for the en masse copyright violations they base their industry on.

[–] JackbyDev@programming.dev 2 points 6 months ago (2 children)

The irony is that folks complain about stuff like Discord partly because it cannot be scraped by search engines but that would also protect it from being scraped by AI tools.

[–] the_toast_is_gone@lemmy.world 1 points 6 months ago (1 children)

Until Discord either starts selling data to OpenAI or they start scraping data from/similar to sites like https://spy.pet/ .

[–] JackbyDev@programming.dev 1 points 6 months ago

Believe me, I'm not saying Discord is the bastion of hope for data protection or anything like that lol.

load more comments (14 replies)