this post was submitted on 22 Feb 2024
207 points (97.7% liked)

Technology

59605 readers
3415 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Google Will Pay Reddit $60M a Year to Use Its Content for AI: Report::The move boosts revenue for Reddit ahead of its planned stock launch.

you are viewing a single comment's thread
view the rest of the comments
[–] bbkpr@lemmy.world 11 points 9 months ago (5 children)

Since half or more of reddit is now bots and shills, I don't imagine the training data is going to be great. That's fine, Gemini already sucks, so it'll be hard to make it worse.

[–] Dexx1s@lemmy.world 1 points 9 months ago (4 children)

The data being generated now sure, but there's still the years of actually useful data there.

Then add on the remaining half of comments that are from sensible users and it's a decent, and still fairly unique, dataset.

[–] bbkpr@lemmy.world 4 points 9 months ago* (last edited 9 months ago) (2 children)

There are many, many, many things posted as fact over the years on reddit that are not only untrue, but dangerous or even deadly in the case of some of the most idiotic advice given. I wish good luck telling them all apart to the poor 3rd world contractors the big commercial AI companies ~~exploit~~use to "train" their stochastic parrots.

[–] GluWu@lemm.ee 2 points 9 months ago* (last edited 9 months ago) (1 children)

That was one of my favorite shitposting formats. I would type a whole paragraph with technical details and real knowledge. Only the people who actually knew what I was talking about would realize its a shitpost.

[–] bbkpr@lemmy.world 3 points 9 months ago* (last edited 9 months ago)

Yep, and a lot of reddit is thinly veiled shitposts, bots, and uncredited karma whoring reposts of stolen content (the commercial AI companies should feel right at home here). Some of them are to anger the self righteous redditors who come to PC police anyone who dares speak against the far left zeitgeist. But most importantly, so, so many of them are just for the lols.

The scariest part is that those drawn out, apparently accurate but actual nonsense posts/comments, is how many of them end up near the top, with massive numbers of votes from those who think "well that sounds reasonable," but know nothing of the subject itself.

Semi-related: I really loved the shitposts where the guy would tell an elaborate story, and end it with his dad beating the shit out of him with jumper cables. Now that's quality reddit content.

[–] SpeakinTelnet@sh.itjust.works 1 points 9 months ago

Shoutout to the undertaker threwing mankind off hell in a cell and the lengthy false facts I learned from this user.

load more comments (1 replies)
load more comments (1 replies)