this post was submitted on 01 Oct 2025
919 points (97.5% liked)

Technology

75704 readers
4505 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] M1ch431@slrpnk.net 2 points 1 day ago* (last edited 1 day ago) (1 children)

It's noise, a very large part of it. Reddit is financially motivated to make the data appear as if it is signal. It isn't - they have taken extremely minimal steps to ensure actual human participation.

This doesn't matter to AI companies, but it only warps that technology more and more. AI is a sinking ship with current methodologies. Reddit will die when the AI bubble bursts and those involved with Reddit already cashed out enough to be filthy rich.

[–] FlexibleToast@lemmy.world 1 points 1 day ago (1 children)

At this point we're just speculating. We don't have evidence either way of its mostly good or mostly bad data.

[–] M1ch431@slrpnk.net 0 points 1 day ago* (last edited 1 day ago) (1 children)

If you can land me a gig engaging with back end data from Reddit in a neutral capacity, it'd likely be pretty easy for a layman like me to confirm that it's largely noise. The AI companies buying data are getting scammed and you are free to remain neutral or plainly disagree with my assessment in the absence of concrete data that is publicly obtainable.

No company is immune to bots and inorganic engagement, least of all Reddit with the strategies employed.

[–] FlexibleToast@lemmy.world 1 points 1 day ago (1 children)

You keep trying to throw out all the data because some is bad. I don't think these companies would be paying that much for all bad data. You seem to really want to justify some bias you have. It's really weird.

[–] M1ch431@slrpnk.net 1 points 1 day ago* (last edited 1 day ago)

Reddit is presumably the only party with the ability to determine which data/interaction is truly legitimate or isn't.

I have a mostly neutral opinion on AI/LLMs, but I have a negative assessment of the companies driving it unsustainably. They are able to pay big money because AI is where money is flowing. When these actors crash the economy with their hubris, please write back to me - maybe I won't seem so weird at that point.