nazdridoy/kokoro-tts

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.

61
/ 100
Established

This tool converts written text, including entire books and documents, into natural-sounding speech. You provide text files, EPUBs, or PDFs, and it generates audio files in various voices and languages, which can be streamed or saved as MP3s or WAVs. It's designed for anyone who needs to listen to long-form text, such as students, researchers, or audiobook creators.

1,296 stars. Available on PyPI.

Use this if you need to transform documents or books into audio, customize voices (even blending them), and handle content across multiple languages.

Not ideal if you prefer a graphical user interface (GUI) or need a solution that doesn't involve command-line usage.

audiobooks document-accessibility language-learning content-creation reading-aloud
Maintenance 6 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 20 / 25

How are scores calculated?

Stars

1,296

Forks

142

Language

Python

License

MIT

Last pushed

Dec 15, 2025

Commits (30d)

0

Dependencies

7

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/nazdridoy/kokoro-tts"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.