rsxdalv/TTS-WebUI
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
This tool helps creators, content developers, and educators transform written text into natural-sounding speech or generate unique audio. You provide text or audio inputs and it produces high-quality spoken audio, music, or sound effects, allowing for a wide range of voice and stylistic options. It's ideal for anyone needing to create custom voiceovers, audio content, or soundscapes without needing professional recording equipment.
3,017 stars. Actively maintained with 8 commits in the last 30 days.
Use this if you need a flexible way to generate speech from text, create custom audio, or convert voices using various advanced AI models.
Not ideal if you require only basic, pre-recorded voice clips or very simple text-to-speech without advanced customization or model switching.
Stars
3,017
Forks
305
Language
TypeScript
License
MIT
Category
Last pushed
Feb 19, 2026
Commits (30d)
8
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rsxdalv/TTS-WebUI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
playht/pyht
PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API
aedocw/epub2tts
Turn an epub or text file into an audiobook
DrewThomasson/VoxNovel
VoxNovel: generate audiobooks giving each character a different voice actor.
gianpaj/sexyvoice
Voice Cloning, Voice Call and Text to Speech platform. Perfect for content creators, developers,...
ohmstone/pocket-tts-deno
WASM ONNX build of Pocket TTS with voice cloning adapted from pocket-tts-server to run as a Deno...