this post was submitted on 24 Feb 2026
321 points (95.5% liked)

Technology

81802 readers
4342 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

you are viewing a single comment's thread
view the rest of the comments
[–] renzhexiangjiao@piefed.blahaj.zone 37 points 5 hours ago (3 children)

you can like... enforce this rule programatically? you don't have to say "pretty please" to ai? basically, when AI requests some potentially unwanted thing (like deleting an email), this request goes through a proxy that asks the human for confirmation. Also you can have a safe word set up in the chat interface to act as a killswitch. I thought these are ABCs of ai safety but apparently these are foreign concepts to this "safety director"

[–] underscores@lemmy.zip 4 points 1 hour ago* (last edited 31 minutes ago)

The people that design AI tools don't implement guardrails because then they'd have to admit AI is not ready for the shit they're trying to make

[–] zqps@sh.itjust.works 22 points 4 hours ago

The people who internalize this would never engage with a chatbot in this way in the first place. To them this is another intelligence they're conversing with, where you get what you want by following social decorum and enforcing your will amounts to abuse.

[–] HobbitFoot@thelemmy.club 18 points 4 hours ago

Program? Like a fucking farmer?