this post was submitted on 20 Apr 2025
84 points (97.7% liked)

Selfhosted

46168 readers
448 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
 

I have bunch of textbooks, and a lot of lecture notes and notes from colleagues, all in PDF format. What is a good way to classify, manage, store, and read these PDF files? I am trying calibre-web, but it seems difficult to find applications to connect to it.

top 30 comments
sorted by: hot top controversial new old
[–] starkzarn@infosec.pub 53 points 2 days ago (3 children)
[–] Feddinat0r@feddit.org 9 points 2 days ago

I second this. Using this for about half an year as my full document store, letters, anything.

Search is great, lovin it

[–] b3an@lemmy.world 3 points 2 days ago (2 children)

I third this! I saw title and came to say.

It’s actively being developed still, I get emails like once every 1–3 weeks, sometimes more. Sometimes less.

I use docker desktop for this. I also lowkey learned how to set up a multi-database for this at one point, but kinda stopped after I got it working. More to see if I could.

I also tried bare metal building this, but had shit luck. It’s been a couple years though. Docker just makes it easy as hell.

I still keep all the originals separate just in case, and the tool can help you make multiple copies too (like PDF-A). I’ve never needed to go back and use those though, as Paperless just works so well once you get the hang of it and how you want your data stored.

I picked a structure that kind of lets me find stuff easily even if the tool is not running (like just by folder structures).

I’ve yet to make this online available for obvious reasons. But it would be nice to be able to pull up pretty much any document you need, any time.

Any suggestions on safe web access quickly from a phone might be helpful (WireGuard maybe?) if you have them.

[–] walden@sub.wetshaving.social 5 points 2 days ago

For remote access, wireguard is great. You can access stuff via their internal addresses.

[–] nickiam2@aussie.zone 3 points 2 days ago

Tailscale is how I access my server. I've got a domain name that points to the internal tailscale IP address, but that's not really necessary

[–] Sunny@slrpnk.net 5 points 1 day ago

I believe this new project should hit your need quite well!

Papra is quite new in the selfhosted sphere but a welcome addition. Yet to test it myself but it sounds and looks very promising > https://github.com/papra-hq/papra

[–] m@social.tthi.as 15 points 2 days ago (1 children)

As a card-carrying librarian, I recommend using Zotero as a client with a WebDAV backend (I use Nextcloud).

If you’re studying or writing anything in which you need to cite your sources, Zotero is excellent and has integrations with many word processors. I’m pretty sure it can output your references as BibTeX if you’re in one of the disciplines that uses LaTeX.

[–] mobotsar@sh.itjust.works 11 points 2 days ago* (last edited 2 days ago)

I'm not even a librarian but pshh, I still got a card. They give them out to anyone, you know.

[–] db_geek@norden.social 15 points 2 days ago (1 children)
[–] meyotch@slrpnk.net 2 points 2 days ago

Seconded, for the second time!

Paperless is very easy to install and maintain. I use it for both scientific pdfs and random receipts. It’s easy to keep them separate

[–] philpo@feddit.org 6 points 2 days ago (1 children)

Contrary to the others here,while I love Paperless,using it for textbooks and notes only worked "somewhat" for me - it becomes quite clunky after a while.

Personally I would rather go with Calibre if I were you if you have more textbooks than notes. Even for notes, they can be attached as well and better organised than Paperless.

(And don't get me wrong paperless is awesome and I use it heavily)

[–] someacnt@sh.itjust.works 1 points 1 day ago

Thanks, I am trying both paperless and calkbre and see which works better for which tasks.

[–] stegosaur5491@lemm.ee 5 points 2 days ago

Maybe have a look at pdfding. It isn't to bloated and pretty simple, but has all the features that are essential for me. I like it.

[–] xgranade@lemmy.blahaj.zone 2 points 1 day ago

I'm a big fan of Docspell, there's lots of ways to import docs in (watching a folder, watching an e-mail account, etc), and it plays really well with my IdP instance over OIDC.

[–] weker01@sh.itjust.works 4 points 2 days ago (2 children)

What's wrong with just folders and file names?

[–] raltoid@lemmy.world 6 points 2 days ago* (last edited 2 days ago)

Benefit of data managers: Tags for easier searching and grouping, grouping that is folder agnostic, easily choosing a thumbnail, and in text focused ones you can usually search the content of the files from one location and easily look through the results for the correct one, etc.

[–] Appoxo@lemmy.dbzer0.com 2 points 1 day ago* (last edited 1 day ago) (1 children)

The same reason why immich fucking rocks.

Search pictures by "ocean" or "moon" and get shown related pictures.

If those doc managemnt solutions can also do OCR it's a doublw win.

[–] weker01@sh.itjust.works 1 points 1 day ago (1 children)

Hmmm. Maybe I should try that then. Never actually understood why people like these managers as I was always satisfied with the directory tree for organization.

Well maybe besides music. There beets fucking rocks. But in the end I use it also only to sort music into a directory structure.

[–] Appoxo@lemmy.dbzer0.com 1 points 1 day ago

I configured my immich to sort the photos by YYYY/MM/#original-photo-name#_assetnumber

Should I leave immich, I could still recover or view my files

[–] maxprime@lemmy.ml 4 points 2 days ago

All great recommendations here. But I’ve heard good things about PdfDing. I haven’t used it myself but have followed development since the developer is quite active.

[–] grimer@lemmy.world 3 points 2 days ago

Not sure what your preferred platform is but I’ve had great success connecting to my Calibre-web site with Yomu on iOS.

[–] N0x0n@lemmy.ml 6 points 2 days ago* (last edited 2 days ago) (1 children)

Not self-hosted so I doesn't really answer your question... However, if you're still a student consider the switch to Zotero.

Things you can self-host though, to make your books available everywhere, is some webdav sever to link your books directly to zotero and access them on every device.

If you're serious about book reading and study, nothing beats Zotero !

[–] Engywuck@lemm.ee 4 points 2 days ago

You can also actually not self-hosting the database but have your documents hosted on some WebDAV server you control.

[–] asceticism@lemmy.world 3 points 2 days ago

Not quite the correct application but linkwarden would work. Stores all your links but also backs everything up via html, plain text, and pdf. You can categorize content and tag content. Then there are filter and search tools.

You can just give it PDFs and it will import them over as well. Only saves them as a pdf but still would work.

I'm guessing this is not the best approach but wanted to give you options.

[–] sonalder@lemmy.ml 3 points 2 days ago (1 children)
[–] mcchots@sh.itjust.works 6 points 2 days ago (1 children)

StirlingPDF is great, but more of a PDF editor.

OP wants something to store and manage his PDF’s.

[–] sonalder@lemmy.ml 1 points 2 days ago

You're right I was certain it was doing both editing and managment but my memory played me

[–] thejevans@lemmy.ml 1 points 2 days ago

Not self hosted necessarily, but TagStudio is an interesting project worth keeping an eye on https://docs.tagstud.io/

[–] dominiquec@lemmy.world 0 points 2 days ago

I use Ubooquity.