this post was submitted on 27 Jul 2025
286 points (91.1% liked)

Technology

73379 readers
4180 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Passerby6497@lemmy.world 3 points 3 days ago (1 children)

when the reason for the offending output is that the user spent significant deliberate effort in coaxing the LLM to output what it did?

What about all the mentally unstable people who aren't trying to get to say crazy things, end up getting it to say crazy things just by the very nature of the conversations they're having with it? We're talking about a stochastic yes man who can take any input and turn it into psychosis under the right circumstances, and we already have plenty of examples of it sending unstable people over the edge.

The only reason this is "click bait" is because someone chose to do this, rather than their own mental instability bringing this out organically. The fact that this can, and does, happen when someone is trying to do it should make you really consider the sort of things it will tell someone who may be in a state where they legitimately consider crazy shit to be good advice.

[–] backgroundcow@lemmy.world -1 points 3 days ago* (last edited 3 days ago)

The only reason this is "click bait" is because someone chose to do this, rather than their own mental instability bringing this out organically.

This is my point. The case we are discussing now isn't noteworthy, because someone doing it deliberately is equally "impressive" as writing out a disturbing sentence in MS Paint. One cannot create a useful "answer engine" without it being capable of producing something that looks weird/provoking/offensive when taken out of context; no more than one can create a useful drawing program that blocks out all offensive content. Nor is it a worthwhile goal.

The cases to care about are those where the LLM takes a perfectly reasonable conversation off the rails. Clickbait like the one in the OP is actually harmful in that they drown out such real cases, and is therefore deserving of ridicule.