remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
This project helps content creators, educators, and developers quickly turn written text into natural-sounding speech across multiple languages like English, Japanese, and Chinese. You provide text and select voices, and it outputs high-quality audio files, even allowing for custom voice mixes. It's designed for individuals or teams needing on-demand, customizable text-to-speech capabilities.
4,585 stars.
Use this if you need to generate spoken audio from text for videos, e-learning modules, virtual assistants, or any application requiring custom voice output without external cloud services.
Not ideal if you require real-time, ultra-low latency speech generation for live conversations, or if you need to clone specific unique voices from audio samples.
Stars
4,585
Forks
764
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 04, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/remsky/Kokoro-FastAPI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
thewh1teagle/kokoro-onnx
TTS with kokoro and onnx runtime
nazdridoy/kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with...
Lyrcaxis/KokoroSharp
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and...
met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs...
lucasjinreal/Kokoros
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with...