TTS-Audio-Suite and VibeVoice-ComfyUI

These are complements that serve different TTS engine preferences within the same ComfyUI workflow—TTS-Audio-Suite provides a multi-engine aggregator supporting RVC, Echo-TTS, Qwen3-TTS and others, while VibeVoice-ComfyUI specializes exclusively in Microsoft's VibeVoice model for users prioritizing that specific architecture's multi-speaker synthesis capabilities.

TTS-Audio-Suite

Established

VibeVoice-ComfyUI

Established

Maintenance 25/25

Adoption 10/25

Maturity 15/25

Community 18/25

Maintenance 10/25

Adoption 10/25

Maturity 15/25

Community 23/25

Stars: 774

Forks: 71

Downloads: —

Commits (30d): 79

Language: Python

License: —

Stars: 1,391

Forks: 219

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

No Package No Dependents

About TTS-Audio-Suite

diodiogod/TTS-Audio-Suite

A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools

This suite helps video producers, content creators, and educators quickly turn written scripts into natural-sounding speech across many languages and voices. You input your text, choose from various AI voices, and the system generates audio, complete with precise timing for subtitles. It's designed for anyone needing professional-grade voiceovers or narrated content without hiring voice actors.

video-production content-creation localization e-learning audio-narration

About VibeVoice-ComfyUI

Enemyx-net/VibeVoice-ComfyUI

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.

This tool helps content creators and storytellers generate natural-sounding speech from text directly within their ComfyUI workflows. You provide written scripts, and it outputs high-quality audio, including options for single voices or dynamic multi-speaker conversations. It's designed for anyone needing realistic voiceovers, character dialogue, or narrated content.

content-creation voiceover storytelling audio-production digital-media

Related comparisons

TTS-Audio-Suite and ComfyUI-VibeVoice TTS-Audio-Suite and ComfyUI-VoxCPM TTS-Audio-Suite and ComfyUI-EdgeTTS TTS-Audio-Suite and ComfyUI-XTTS TTS-Audio-Suite and ComfyUI-Maya1_TTS TTS-Audio-Suite and ComfyUI-SparkTTS

Scores updated daily from GitHub, PyPI, and npm data. How scores work