This is the most frustrating thing, so many people are arguing against their own interests with their efforts to "lock down" their content to prevent AIs from training on it. In this very thread I've been accused of being pro-giant-company when I'm quite the opposite. The harder we make it to train AI, the stronger the advantage that the existing giant companies have in this field.
FaceDeer
Thanks. Sometimes a randomly-chosen name from 13 years ago just takes on a life of its own over time. :)
AI trainers do a lot of work filtering and reformatting the training data. Often that's the most expensive part. There's a lot of synthetic data used these days too, reprocessed by other AIs.
You can't put conditions on it retroactively. You already published.
Negative examples are often just as useful for training an AI as positive ones. And it all depends on what you want to use the AI for. A moderator bot, for example, needs familiarity with the whole range of user responses it might see.
After all the hue and cry I have seen over stuff like Threads and Bluesky federation I don't imagine most people using the Fediverse have a particularly coherent philosophy on the matter.
That's up to the recipient to sort out as they need.
I'm not sure what you mean here. Nothing's being stolen. Even if you think there needs to be permission for training an AI off of data, Reddit has that permission.