this post was submitted on 28 Apr 2026
20 points (100.0% liked)

Selfhosted

58818 readers
821 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

  7. No low-effort posts. This is subjective and will largely be determined by the community member reports.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
 

I've been thinking about adding this to my "Fuck it, I'll do it myself" / SHTF pile. I have a spare 10-15GB for a good selection of basic articles (across sciences, history, pop culture trivia etc).

https://get.kiwix.org/en/solutions/hotspots/content-bundles/

https://get.kiwix.org/en/solutions/hotspots/imager-service/

There's something inherently cool about having wikipedia in a box (yes, you'd likely need to refresh it once a year) but I've never heard of anyone actually self hosting a Kiwix instance.

top 12 comments
sorted by: hot top controversial new old
[–] surfrock66@lemmy.world 8 points 1 hour ago (1 children)

Yes, and I actually use it to train a local llm so I'm not hammering the internet. I have a ton of storage, and like to keep my kids in the sandbox, so we have wikipedia, project gutenberg, kahn academy, and a bunch of others all hosted behind an apache reverse proxy which is using mellon so there's LDAP auth.

[–] SuspciousCarrot78@lemmy.world 3 points 51 minutes ago (1 children)

That was actually my immediate thought. I already have Wikipedia as a trusted source for llm, but I would prefer to self host and not hammer them.

130GB to fit the entirely of Wikipedia is basically nothing and I'm mildly embarrassed not to have done it already.

[–] surfrock66@lemmy.world 1 points 43 minutes ago

I also try to participate in some of the farms, running zimit and mwoffliner to help make more archives. Feels like I'm helping.

[–] IcedRaktajino@startrek.website 9 points 1 hour ago* (last edited 1 hour ago) (1 children)

Yep, and I love it.

I've got a little Banana Pi M4 Zero (PiZero form factor but much more powerful and with 4 GB RAM) loaded up with, among other useful tools, Kiwix and the full Wikipedia dump. I just refreshed it with the 2026-02 full dump, so I'm caught up for the year. I've also got a lot of other offline docs loaded up (React, Bun, and the devdocs for several libraries I use) and it's nice to have local copies of those instead of googling every time.

Surprisingly, the full ~130 GB Wikipedia dump works fine on a regular Pi Zero 2 with 512 MB RAM. I don't know how ZIM works but it does work very very well.

[–] SuspciousCarrot78@lemmy.world 3 points 1 hour ago* (last edited 1 hour ago) (1 children)

I actually have a spare pi4 doing nothing, so was thinking of adding this to its jobs list.

130GB for the entire thing? And the pi doesn't choke on indexing / searching it?

On that: how capable is the search engine (I assume it has one?)

130GB for the entire thing? And the pi doesn't choke on indexing / searching it?

That was my thought. I knew it couldn't hold it in RAM but thought it would be doing crazy IO and limited by being on SD, but it seems to not be a problem. Like I said, I don't know how ZIM does it, but it does it well. Must have some kind of index that lets it fast travel to the correct blocks or something. I dunno lol.

how capable is the search engine (I assume it has one?)

Yep, it has search. It's....okay but kind of primitive. It's not slow, and if you're searching for something that's fairly unique (as far as keywords go), it does well. But if you're searching something like an acronym where it shows up as a regular word in other entries, it's a lot more hit or miss.

[–] comrade_twisty@feddit.org 5 points 1 hour ago* (last edited 1 hour ago) (2 children)

I do on my TrueNAS in a docker container. I have about 1TB of zim files hosted including pre-LLM copies of German, English and French Wikipedia as well as the last two current versions in these languages.

Aditionally I have project Gutenberg Books in german and english as well as lots of random technical, medical, survival, etc stuff that I came accross - a lot of that is trash though, but sorting is too time consuming and my NAS has 48TB so who cares...

[–] DishaweslemOride@lemmy.org 3 points 1 hour ago (1 children)

Humorously, you could use an agent to help you sort things. If theres anything it’s good at, it’s sorting.

How do you like TrueNAS? I’m too locked in to Synology at this point—with almost 800tb (in physical drives, less actual because of redundancy), and several devices.

[–] comrade_twisty@feddit.org 2 points 1 hour ago

Pretty happy woth TrueNAS, actually came from Synology and bought a UGreen DXP4800Plus, didn't like the UGOS on it and pretty much immediately switched tp TrueNAS. It's been absolutely flawless for about 15 months by now, docker integration in the OS is a bit limited by I run my compose stacks managed through dockge anyways.

I won't let LLMs crawl my data, it's mine and mine alone :)

[–] SuspciousCarrot78@lemmy.world 2 points 1 hour ago (1 children)

That's awesome. If I understand correctly, kiwix server creates a local site you can access from anything on your wLAN, as a transparent website? I take it it auto populates with your ZIM files, and that you can add to it (eg: project Gutenberg).

If so, that's a hell of a thing.

[–] comrade_twisty@feddit.org 3 points 1 hour ago

Yep exactly. Also you can have other people (friends/family) have access via VPN, Tailscale, etc.

[–] shadybraden@programming.dev 1 points 50 minutes ago