this post was submitted on 09 Jan 2025
1987 points (98.3% liked)
Technology
69098 readers
2881 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Stack Overflow seems to be doing a prompt job of that already
You can download pretty much all of stackoverflow as ZIM files for self-hosting.
Of course they aren't small, but they are probably as small as it gets, since they are pretty efficiently compressed. I am not sure what you mean by
since it is really trivial to use them. Just load them with Kiwix and serve them as a website. It doesn't get much easier than that.
I looked into doing something similar with Wikipedia and the recommendation is also to use Kiwix, and the offline file size is also very large.
Welcome to the collapse! Hoarding "clean data" for personal use is like hoarding clean water and food: you need a place to keep it, and it starts going stale the minute you shelve it. So either buy a digital bunker to load up with what you need or ask the all knowing AI gods for answers like the other poors.
Also the Stack Exchange software used to be open source, surely there's still a fork somewhere. You could certainly run your own Developer QA site, but like with Lemmy, the problem then is getting enough traffic to be able to productively tap into the collective wisdom.
(Edit: sorry, this comes across mean spirited but I'm honestly sympathetic and just nihilisticallly frustrated to be in a similar situation. I foresee a big NAS and a lot of downloads in my future, but I hope we also find ways to share our forbidden knowledge until the day it can be free again)
When hosting this locally, I don't see how 200 GB is much of an issue. Storage is so cheap these days, if you want to host it locally, just buy a 256 GB SSD just for that data for $20. Anyway, you were asking for a mirror, to which I replied with the information about the ZIM files. I don't really understand the issue. Stackoverflow just isn't that small, there is not much you can do about that.
The download? Maybe, depends on your Internet connection's speed. Actually serving it as a website certainly doesn't take hours. It is rather a matter of seconds.
Neither is SO's content.