soldier444xd/KittenTTS
KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fast start. 😺
This project helps developers integrate high-quality, natural-sounding speech into their applications or devices. It takes plain text as input and generates clear, lifelike audio output that can run efficiently even on mobile phones or small computers. Developers creating apps, games, or embedded systems would find this useful for adding voice features.
Use this if you are a developer who needs to add real-time, high-quality text-to-speech capabilities to an application, especially for mobile, edge, or low-power devices.
Not ideal if you are looking for an out-of-the-box, end-user application for generating voiceovers without any development work.
Stars
24
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/soldier444xd/KittenTTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
snakers4/silero-models
Silero Models: pre-trained text-to-speech models made embarrassingly simple
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot...
JSchmie/ScrAIbe-WebUI
WebUI for ScAIbe
isaiahbjork/orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
snakers4/silero-stress
Silero Stress — pre-trained enterprise-grade automated stress and homograph disambiguation for...