devnen/Kitten-TTS-Server

Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiobooks, and GPU acceleration.

/ 100

Emerging

This project helps anyone who needs to convert written text into natural-sounding spoken audio. You input text, select from various natural voices and models, and it outputs high-quality audio files. This is ideal for content creators, audiobook producers, developers building voice interfaces, or anyone needing to generate speech on their own computer or local network.

246 stars. No commits in the last 6 months.

Use this if you need to generate realistic speech from text using a high-performance system you can run on your own hardware, including compact devices like a Raspberry Pi, or with GPU acceleration.

Not ideal if you prefer a cloud-based service, require a very wide range of unique voices or languages beyond what's offered, or have extremely simple, infrequent text-to-speech needs.

audiobook-production content-creation voice-over local-AI-deployment home-automation

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 16 / 25

How are scores calculated?

Stars

246

Forks

Language

Python

License

MIT

Compare

Kitten-TTS-Server and KittenTTS

Higher-rated alternatives

snakers4/silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot...

JSchmie/ScrAIbe-WebUI

WebUI for ScAIbe

isaiahbjork/orpheus-tts-local

Run Orpheus 3B Locally With LM Studio

snakers4/silero-stress

Silero Stress — pre-trained enterprise-grade automated stress and homograph disambiguation for...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights