rsxdalv/TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

63
/ 100
Established

This tool helps creators, content developers, and educators transform written text into natural-sounding speech or generate unique audio. You provide text or audio inputs and it produces high-quality spoken audio, music, or sound effects, allowing for a wide range of voice and stylistic options. It's ideal for anyone needing to create custom voiceovers, audio content, or soundscapes without needing professional recording equipment.

3,017 stars. Actively maintained with 8 commits in the last 30 days.

Use this if you need a flexible way to generate speech from text, create custom audio, or convert voices using various advanced AI models.

Not ideal if you require only basic, pre-recorded voice clips or very simple text-to-speech without advanced customization or model switching.

content-creation voiceover-production e-learning-material audio-narration sound-design
No Package No Dependents
Maintenance 17 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

3,017

Forks

305

Language

TypeScript

License

MIT

Category

text-to-speech

Last pushed

Feb 19, 2026

Commits (30d)

8

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rsxdalv/TTS-WebUI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.