Hi people
Is there any good FOSS dictation and transcription software similar to Just Press Record for Android and/or Linux?
I couldn’t find anything so far. Thanks a lot.
Not FOSS as it’s under another license, but there’s “FUTO Voice Input” if you’re looking for a local alternative to Google’s voice dictation on Android
https://gitlab.futo.org/alex/voiceinput
The repo has a list of supported and unsupported Android keyboards. Under the hood it uses OpenAI Whisper
I just tried this out. Besides the fact that it doesn’t stream and translates after speech is done, it’s absolutely fantastic. Of course, it would suffer in accuracy if it were translating a stream so I think that’s a plus instead of a minus, although some people might not think so.
Maybe not exactly what you’re looking for but I found this a few weeks ago https://github.com/k2-fsa/sherpa-onnx and I haven’t really seen anyone talk about it
I’ve been using the tts on android for navigation and its way better than rhvoice and espeak.
I did try stt on android and it worked great but I’ve never used stt before so I don’t know how good it is compared to other stt
I can vouch for whisper.cpp . It’s not 100% perfect but it’s good enough to transcribe a half hour podcast with numerous speakers and which requires pretty minimal fixing afterwards.
Falling into the not sure how open source it is because AI is a mess. But it works category…