this post was submitted on 04 Jun 2026
49 points (100.0% liked)

Selfhosted

59637 readers
650 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

  7. No low-effort posts. This is subjective and will largely be determined by the community member reports.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 3 years ago
MODERATORS
 

I'm starting to develop arthritis in my fingers, which makes typing an interesting challenge.

I'm wondering, what is the best self-hosted solution for speech-to-text generally? Dragon dictate use to be the thing, but is there anything open source, self-hostable that's superseded it?

I would love to be able to have something that I can speak into that can interact with pretty much any app, be that notepad++, or my web browser when I'm entering stuff or even when I'm creating this Lemmy post (which I actually made using futo voice on my phone).

Windows and/or Linux ideally.

Any leads? Getting old sucks.

top 18 comments
sorted by: hot top controversial new old
[–] adarza@piefed.ca 1 points 9 hours ago (1 children)

the only guy i have doing dictation into a pc (and not into a special system like for medical records) regularly uses dragon on windows using a somewhat expensive microphone, a setup he's had for probably 30 years. he has one of those old-timey radio host voices, that might help some.. but i rarely ever see him having to edit what dragon spits out

[–] irmadlad@lemmy.world 1 points 7 hours ago

he has one of those old-timey radio host voices

Just popped into my head

NSFW!!!

https://www.youtube.com/watch?v=l_PkJ_4oEjc

[–] HelloRoot@lemy.lol 3 points 18 hours ago* (last edited 18 hours ago) (1 children)

I've recently read this super interesting and in depth blogpost about this topic:

https://nocoffei.com/?p=451

[–] rowinxavier@lemmy.world 6 points 22 hours ago (2 children)

A quick question, what accent do you have? As an Aussie I have real trouble speaking naturally with most speech to text software, open or closed source. I feel like the guys in that Scottish sketch show in the voice activated elevator. I sometimes use voice dictation for my notes for work and I spend almost as much time correcting as I do speaking.

That said, I have found a perfect solution. I can get well over the 95% correct mark by simply using an English or American accent. I can do both fairly well and the speech to text has no complaints. I imagine someone from Boston would have a tonne of trouble being understood, as would a Welsh person, but pretending to be a Californian or similar can help immensely.

I would love to find something that can be trained by my speech like Dragon Naturally Speaking used to be. I used that in the early 2000s and at first it was awful, but training it for a few hours really did offer a noticeable improvement, and ongoing use continued to improve further. My computer died and I lost all the trained data, so I never went back, but if I could I would definitely do that again.

[–] irmadlad@lemmy.world 2 points 15 hours ago

I feel like the guys in that Scottish sketch show in the voice activated elevator.

Just to say, that is a hilarious sketch. ELEVEN!

[–] SuspiciousCarrot78@aussie.zone 1 points 18 hours ago

Aussie too - same issue with having to fake accent sometimes :)

[–] theorangeninja@sopuli.xyz 3 points 19 hours ago (1 children)

I used Speed of Sound for a bit on desktop linux and after triggering the global shortcut it pastes the text where you have your cursor. You can use cloud models, local network hosted models or just download whisper/parakeet directly in the app. Worked great for me so far.

If you want to try something else on your phone you can try Outspoke.

[–] SuspiciousCarrot78@aussie.zone 2 points 18 hours ago
[–] wesker@lemmy.sdf.org 9 points 1 day ago (1 children)

I've started my own foray into push-to-talk and speech-to-text recently. I opted for faster-whisper since I wanted a lighter, local approach. It might be more DIY than what you're looking for, but I just wanted to chime in since it's fresh on my mind.

[–] SuspiciousCarrot78@aussie.zone 1 points 18 hours ago

Thanks - that's worth a gander

[–] Mountainaire@lemmy.world 7 points 1 day ago (1 children)

So sorry about your finger pain. I don't know if this can be self-hosted, but https://github.com/jatinkrmalik/vocalinux has been pretty incredible.

[–] SuspiciousCarrot78@aussie.zone 1 points 18 hours ago
[–] jbloggs777@discuss.tchncs.de 3 points 22 hours ago (1 children)

I was recently reading about Talon Voice, which sounded quite interesting with lots of usability hacks. Unfortunately, not an option anymore if you are a Linux user: https://www.osnews.com/story/145162/accessibility-input-tool-removes-x11-support-doesnt-want-to-support-wayland-users-caught-in-the-middle/

[–] SuspiciousCarrot78@aussie.zone 1 points 18 hours ago
[–] probable_possum@leminal.space 4 points 1 day ago* (last edited 1 day ago) (1 children)

I can't contribute much. Futo Keyboard is nice for android. Speech Note is a tool for linux that I use from time to time... It is a bit slow without GPU acceleration and the resulting text are about 95% correct.

Have you read this comparison? https://openwhispr.com/blog/best-dictation-tools-linux-2026... It smells a bit like advertising for openwhispr but contains some pointers.

https://m.youtube.com/watch?v=5d_jGOGRSMc&t=16m55s. After a while of talking he mentions KDE connect remote input which could be used with futo keyboard dictation, I guess?

[–] SuspiciousCarrot78@aussie.zone 1 points 18 hours ago

openwhispr

Second time today that has come up - I better go look. Cheers!