this post was submitted on 21 Aug 2025
206 points (89.3% liked)

Selfhosted

50716 readers
530 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
 

Some thoughts on how useful Anubis really is. Combined with comments I read elsewhere about scrapers starting to solve the challenges, I'm afraid Anubis will be outdated soon and we need something else.

you are viewing a single comment's thread
view the rest of the comments
[–] GuillaumeRossolini@infosec.exchange 4 points 1 day ago (2 children)

@mfed1122 @tofu any client-side tech to avoid (some of the) bots is bound to, as its popularity grows, be either circumvented by the bot’s developers or the model behind the bot will have picked up enough to solve it

I don’t see how any of these are going to do better than a short term patch

[–] mfed1122@discuss.tchncs.de 1 points 7 hours ago (1 children)

Yeah, you're absolutely right and I agree. So then do we have to resign the situation to being an eternal back-and-forth of just developing random new challenges every time the scrapers adapt to them? Like antibiotics for viruses? Maybe that is the way it is. And honestly that's what I suspect. But Anubis feels so clever and so close to something that would work. The concept of making it about a cost that adds up, so that it intrinsically only effects massive processes significantly, is really smart...since it's not about coming up with a challenge a computer can't complete, but just a challenge that makes it economically not worth it to complete. But it's disappointing to see that, at least with the current wait times, it doesn't seem like it will cost enough to dissuade scrapers. And worse, the cost is so low that it seems like making the cost significant to the scrapers will require really insufferable wait times for users.

[–] GuillaumeRossolini@infosec.exchange 1 points 39 minutes ago

@mfed1122 yeah that is my worry, what’s an acceptable wait time for users? A tenth of a second is usually not noticeable to a human, but is it useful in this context? What about half a second, etc

I don’t know that I want a web where everything is artificially slowed by a full second for each document

[–] rtxn@lemmy.world 9 points 1 day ago* (last edited 1 day ago) (1 children)

That's the great thing about Anubis: it's not client-side. Not entirely anyways. Similar to public key encryption schemes, it exploits the computational complexity of certain functions to solve the challenge. It can't just say "solved, let me through" because the client has to calculate a number, based on the parameters of the challenge, that fits certain mathematical criteria, and then present it to the server. That's the "proof of work" component.

A challenge could be something like "find the two prime factors of the semiprime 1522605027922533360535618378132637429718068114961380688657908494580122963258952897654000350692006139". This number is known as RSA-100, it was first factorized in 1991, which took several days of CPU time, but checking the result is trivial since it's just integer multiplication. A similar semiprime of 260 decimal digits still hasn't been factorized to this day. You can't get around mathematics, no matter how advanced your AI model is.