this post was submitted on 04 Sep 2024
140 points (84.3% liked)
Fediverse
28444 readers
664 users here now
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!
Rules
- Posts must be on topic.
- Be respectful of others.
- Cite the sources used for graphs and other statistics.
- Follow the general Lemmy.world rules.
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
@dch82 first, "normies" have to not get harassed when they come here.
Unfortunately the biggest Fedi software refuses to add automated reporting of offensive posts so if it's not reported, the admins won't even see it.
People coming from corporate social media are used to ignoring the report button because in their experience, it either doesn't work, or gets ignored by admins anyway.
We need automated reporting.
@fediverse
I'm fine with auto REPORTING, but the actual moderation needs to be a human. Auto moderation is bad. It gets things wrong. It's how I got banned from both twitter (calm down, this was back in 2018 before it was an elon owned nazi cesspool), and reddit.
On twitter I saw a funny video that was posted, and I replied "Aw man, that killed me".
I was banned for "inciting death threats"
@Lost_My_Mind yeah, just reporting.
I want to do the actual judgement, but if I don't know the post exists, I can't judge anything and it makes me so mad that possible racist stuff can exist on my instance without my knowledge because I havent "seen" it.
@fediverse
That's the thing about automation and training models.
First, they implement some sort of auto-reporting bot that requires a human to review them. In the beginning, it only about 50% accurate, but as they give it more and more examples of good and bad results through the human reviews, it moves to 80%, then 90%, then 99%, then 99.99% accuracy.
After a while, the humans on the other end are so numb to the 9999 entries they have to mark as approved that they can barely tell what's a rejection themselves, and the moderation team is asking itself just what this human review is actually doing. If it's 99.99% accurate, why not let the bot decide?
Then, the model moves on from auto-reporting to auto-moderation.