nazdridoy/kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
This tool converts written text, including entire books and documents, into natural-sounding speech. You provide text files, EPUBs, or PDFs, and it generates audio files in various voices and languages, which can be streamed or saved as MP3s or WAVs. It's designed for anyone who needs to listen to long-form text, such as students, researchers, or audiobook creators.
1,296 stars. Available on PyPI.
Use this if you need to transform documents or books into audio, customize voices (even blending them), and handle content across multiple languages.
Not ideal if you prefer a graphical user interface (GUI) or need a solution that doesn't involve command-line usage.
Stars
1,296
Forks
142
Language
Python
License
MIT
Category
Last pushed
Dec 15, 2025
Commits (30d)
0
Dependencies
7
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/nazdridoy/kokoro-tts"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
thewh1teagle/kokoro-onnx
TTS with kokoro and onnx runtime
remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch...
Lyrcaxis/KokoroSharp
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and...
met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs...
lucasjinreal/Kokoros
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with...