this post was submitted on 18 Nov 2024
333 points (94.2% liked)
Technology
59495 readers
3114 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I suspect it may be due to a similar habit I have when chatting with a corporate AI. I will intentionally salt my inputs with random profanity or non sequitur info, for lulz partly, but also to poison those pieces of shits training data.
I don't think they add user input to their training data like that.
They don't. The models are trained on sanitized data, and don't permanently "learn". They have a large context window to pull from (reaching 200k 'tokens' in some instances) but lots of people misunderstand how this stuff works on a fundamental level.
True, but I'm still gonna swear at it until I get to talk to a human