this post was submitted on 18 Oct 2024
48 points (84.3% liked)
Fediverse
28490 readers
602 users here now
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!
Rules
- Posts must be on topic.
- Be respectful of others.
- Cite the sources used for graphs and other statistics.
- Follow the general Lemmy.world rules.
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
(edit: I accidentally a word and didn't realize you wrote 'auto-report instead of deleting them'. Read the following with a grain of salt)
I've played (briefly) with automated moderation bots on forums, and the main thing stopping me from going much past known-bad profiles (e.g. visited the site from a literal spamlist) is not just false positives but malicious abuse. I wanted to add a feature which would censor an image immediately with a warning if it was reported for (say) porn, shock imagery or other extreme content, but if a user noticed this, they could falsely report content to censor it until a staff member dismisses the report.
Could an external brigade of trolls get legitimate users banned or their posts hidden just by gaming your bot? That's a serious issue which could make real users have their work deleted, and in my experience, users can take that very personally.
It's possible. I think it's more difficult than people think. You have to do it on a scale which is blatantly obvious to anyone who's looking, so you're just inviting a ban.
One person swore to me that it would be really easy, so I invited them to try, and they made a gang of bots which farmed karma and then mass-downvoted me, trying to get me banned from my own place. If you look at my profile you'll see some things which have -300 score because of it. I welcomed the effort, since I'm interested in how well it will resist that kind of attack. Their first effort did exactly nothing, because none of the downvote bots had any rank within the algorithm. I gave them some pointers on how they could improve for a second time around, and they went radio silent and I haven't heard from them since then.
Haha they thought it was too easy and were proven wrong!
Honestly, if a place is obscure enough, even smaller barriers of entry help, like forums that don't let you post on important boards until you build a reputation. There's only so much effort an adversary is willing to put in, and if there isn't a financial incentive or huge political incentive, that barrier could be low.