diodiogod/TTS-Audio-Suite
A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools
This suite helps video producers, content creators, and educators quickly turn written scripts into natural-sounding speech across many languages and voices. You input your text, choose from various AI voices, and the system generates audio, complete with precise timing for subtitles. It's designed for anyone needing professional-grade voiceovers or narrated content without hiring voice actors.
774 stars. Actively maintained with 79 commits in the last 30 days.
Use this if you need to create diverse voiceovers, localized audio content, or synchronize spoken text with video, especially when working with multiple languages or character voices.
Not ideal if you prefer simple, single-voice text-to-speech without advanced editing, voice conversion, or subtitle timing features.
Stars
774
Forks
71
Language
Python
License
—
Category
Last pushed
Mar 18, 2026
Commits (30d)
79
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/diodiogod/TTS-Audio-Suite"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
Enemyx-net/VibeVoice-ComfyUI
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling...
wildminder/ComfyUI-VibeVoice
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
wildminder/ComfyUI-VoxCPM
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
1038lab/ComfyUI-EdgeTTS
ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS...
eigenpunk/ComfyUI-audio
some generative audio tools for ComfyUI