this post was submitted on 03 Dec 2025
224 points (97.9% liked)

Selfhosted

55596 readers
338 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

  7. No low-effort posts. This is subjective and will largely be determined by the community member reports.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
 

… by running your own instance of the free and open-source federated metasearch engine SearXNG on OpenBSD!

top 50 comments
sorted by: hot top controversial new old
[–] yodeljunkmanenvy@piefed.social 30 points 2 months ago (2 children)

There is also a list of publicly operated SearXNG instances at https://searx.space/. We host the one at https://search.freestater.org/, and there are plenty of other good ones.
bVuZT3n0X1rOSXk.jpg

[–] Appoxo@lemmy.dbzer0.com 3 points 2 months ago (3 children)

Who says they are securely operated and don't store any data??

[–] jackr@lemmy.dbzer0.com 7 points 2 months ago (1 children)

who says that about any search engine? can you trust them? searXNG is usually run by random people who are less likely to use your data than a larger search company

[–] Appoxo@lemmy.dbzer0.com -4 points 2 months ago

At least with Google it wont be an individual that can open my data (most likely).
Probably only the AI or whatever can actually read any of it directly.

[–] yodeljunkmanenvy@piefed.social 3 points 1 month ago* (last edited 1 month ago) (1 children)

The devs at SearXNG have a bot that regularly scans the public instances for changes to the source code and delists them as a public instance if it's altered.

If you go to https://searx.space/, they show the results of the scans for each instance.

The software is free and open source. You are encouraged to inspect the code yourself to make sure no data is collected!

Here is the source code: 

https://github.com/searxng/searxng

[–] Appoxo@lemmy.dbzer0.com 1 points 1 month ago

Thank you very much for proving what's correct :)

load more comments (1 replies)
[–] thejml@sh.itjust.works 24 points 2 months ago

I run it in a container on Kubernetes. Definitely recommend.

[–] jol@discuss.tchncs.de 18 points 2 months ago (2 children)

I still don't understand how Searx is able to operate for free. Don't the API calls cost money?

[–] irmadlad@lemmy.world 17 points 2 months ago

From what I've read, I believe it's a combination of donations. sponsors, volunteer hosting from like minded organizations.

[–] Neptr@lemmy.blahaj.zone 9 points 2 months ago (1 children)

If I had to guess, they probably don't use the APIs, inside using scrapping of some sort.

[–] sunstoned@lemmus.org 1 points 2 months ago

My understanding is it scrapes what it can't meaningfully get out of an API. Public instances run into rate limiting, but private instances don't really have that problem.

[–] ap1werks@sh.itjust.works 16 points 2 months ago (2 children)

look at that domain name! respect

[–] Cyberflunk@lemmy.world 4 points 2 months ago

If you're interested in this, the term you're looking for is punycode

[–] quick_snail@feddit.nl 4 points 2 months ago* (last edited 2 months ago)

Usually its a sign of a scam. Looks like a ransomware c2 domain

[–] carrylex@lemmy.world 12 points 2 months ago* (last edited 2 months ago) (1 children)

For anyone wondering xn--gckvb8fzb.com is マリウス.com

Also: Maybe they should get a "normal" domain when you post english articles...

[–] mesamunefire@piefed.social 1 points 2 months ago* (last edited 2 months ago) (1 children)

Does the url not resolve on certain browsers or something? I usually just copy/paste or use a firefox plugin to generate posts for lemmy/piefed/fediverse.

[–] carrylex@lemmy.world 6 points 2 months ago* (last edited 2 months ago) (1 children)

It resolves well but punnycode is disabled in some browsers or security enhanced profiles, so that you can easier detect punnycode domains that try to fake other domains.

[–] mesamunefire@piefed.social 1 points 2 months ago (1 children)

Im not familiar with punnycode. Interesting.

[–] sludgewife@lemmy.blahaj.zone 1 points 2 months ago

it's punycode

[–] grue@lemmy.world 7 points 2 months ago

Is metasearch really the best we can do? What about YaCy, or something else more like that?

[–] pineapple@lemmy.ml 7 points 2 months ago (1 children)

The search engines that searxng interact with still track you. For this reason I will always use a public instance to mix up the tracking with everyone else using it.

[–] TractorDuffy@lemmy.world 0 points 2 months ago (1 children)
[–] pineapple@lemmy.ml 4 points 2 months ago (1 children)

Using a public instance is more private than using a private selfhosted instance.

[–] TractorDuffy@lemmy.world -1 points 2 months ago (2 children)

..a public Lemmy instance? How is that related to the the search engine?

[–] 044en17rl@lemmy.world 2 points 1 month ago

a public SearXNG instance

[–] pineapple@lemmy.ml 1 points 1 month ago

044en17rl is right I meant a public seaxng instance.

[–] biotin7@sopuli.xyz 7 points 2 months ago* (last edited 2 months ago) (3 children)

Or how about YaCy. It's self-hostable & you can have your own web index and start your own web-crawler.

It's peer-to-peer too

[–] motruck@lemmy.zip 3 points 2 months ago (1 children)

This is exactly what I came here to find. Thank you for posting it. If I can be so bold selfhosters should really be leaning this way searxng is great but it still uses big tech.

The other thing we need is a way to identify good crawling agents or *smol agents over corporate bots that just steal content.

If selfhosters can unite and build a good index perhaps searching can go back to the way it was vs a vector to sell you more and collect your data.

[–] biotin7@sopuli.xyz 1 points 2 months ago

You know what else I like ? Fossil

It's a FOSS alternative to Git itself, their official site itself is a self-hosted instance of Fossil.

[–] mesamunefire@piefed.social 2 points 2 months ago

I personally love yacy.

[–] 044en17rl@lemmy.world 2 points 1 month ago (1 children)

How good are the results compared to Google/Duckduckgo?

[–] biotin7@sopuli.xyz 1 points 1 month ago

You make your own index with this one

[–] MentalEdge@sopuli.xyz 5 points 2 months ago* (last edited 2 months ago)

It's not federated tho?

What do they mean when they call it that?

[–] N0x0n@lemmy.ml 5 points 2 months ago* (last edited 2 months ago) (3 children)

I used to self-host searxng for a while, but somehow the search results were always off and mixed with to much non-relevant results :/.

It's not about searxng itself... Rather how the most relevant info gets drown into AI slope and non-sense bullshit. The best blogposts/info are transmitted from people to people...

I'm kinda sad to admit that stupid AI "solved" this issue and had better results :/

[–] A_Random_Idiot@lemmy.world 3 points 2 months ago* (last edited 2 months ago) (1 children)

I used to self-host searxng for a while, but somehow the search results were always off and mixed with to much non-relevant results :/.

I mean, getting non-relevant results happens with every search engine anymore.

The days of your search results being relevant, and what you want on the first page, are long dead thanks to SEO and other factors.

[–] N0x0n@lemmy.ml 2 points 2 months ago (1 children)

Yeah you're right ! However, ages ago, I still remember how you could go to page 20+ and still find some really interesting things !

Here, past page 2 it's just some random shit...

[–] A_Random_Idiot@lemmy.world 2 points 2 months ago (1 children)

Most of its just AI generated websites. Search for any topic and you're likely to get 50 AI generated website that give a similar bulletpoint presentation of what search item X is according to the AI that generated the site.

[–] N0x0n@lemmy.ml 1 points 2 months ago

50 AI generated website that give a similar bulletpoint presentation of what search item X is according to the AI that generated the site.

Yeah that's what I meant by "random" shit ! 😁

[–] sunstoned@lemmus.org 2 points 2 months ago (2 children)

You can self host that too ;)

OpenWebUI + Ollama + SearxNG. OpenWebUI can do llm web search using the engine of your choice (even self hosted SearxNG!). From there it's easy to set the default prompt to always give you the top (10, 20, whatever) raw results so you're not confined to ai results. It's not quite duck.ai slick but I think I can get there with some more tinkering.

[–] dubyakay@lemmy.ca 3 points 2 months ago (1 children)

Is there a guide on how to do this on Linux + 16GB Radeon?

[–] sunstoned@lemmus.org 6 points 2 months ago* (last edited 2 months ago)

I mean, I could write one! I kind of just pieced it together from guides on the three individuals

Edit: back of the napkin guide below is basically in the OpenWebUI docs already! I use NixOS (btw) but docker/podman should work well.

OpenWebUI + Ollama setup -- tl;dr docker run -d -p 3000:8080 --add-host=host.docker.internal:host-gateway -v open-webui:/app/backend/data --name open-webui --restart always ghcr.io/open-webui/open-webui:main

OpenWebUI SearXNG guide -- a little more involved, but not difficult.

[–] N0x0n@lemmy.ml 1 points 2 months ago

Ohoho? That's interesting. I don't have the horse power to selfhost an AI, but that's good to know !

Thanks for the pointer !!!

[–] Fleppensteijn@feddit.nl 1 points 2 months ago

I don't know any other search that lets you block urls from results. Blocking stuff like social media and Amazon cleans up the results very well.

[–] ArchEngel@lemmy.ca 4 points 2 months ago (1 children)

Thanks for posting, both a great reminder to try setting this up on my unraid, and also to add the RSS feed of that site to Feeder.

[–] pipes@sh.itjust.works 3 points 2 months ago

I just added it too, I had read a few articles of them already

[–] Appoxo@lemmy.dbzer0.com 4 points 2 months ago (2 children)

Brave is a search engine?
That's news to me.

[–] elcroqueta@lemmy.cafe 3 points 2 months ago

Brave have they’re own search engine

[–] Cyberflunk@lemmy.world 1 points 2 months ago

it's no kagi, but its ok

[–] ripcord@lemmy.world 3 points 2 months ago