This is an automated archive made by the Lemmit Bot.
The original was posted on /r/selfhosted by /u/PinGUY on 2025-06-13 13:24:17+00:00.
Wanted to share something I’ve been working on: a Firefox add-on that does neural-quality text-to-speech entirely offline using a locally hosted model.
No cloud. No API keys. No telemetry. Just you and a ~82M parameter model running in a tiny Flask server.
It uses the Kokoro TTS model and supports multiple voices. Works on Linux, macOS, and Windows but not tested
Tested on a 2013 Xeon E3-1265L and it still handled multiple jobs at once with barely any lag.
Requires Python 3.8+, pip, and a one-time model download. There’s a .bat startup option for Windows users (un tested), and a simple script. Full setup guide is on GitHub.
GitHub repo: https://github.com/pinguy/kokoro-tts-addon
Would love some feedback on this please.
Hear what one of the voice examples sound like: https://www.youtube.com/watch?v=XKCsIzzzJLQ