wildminder/ComfyUI-VibeVoice
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
This tool helps content creators, podcasters, and educators generate natural-sounding, multi-speaker audio conversations from a written script. You provide a text dialogue and optionally some reference audio clips for specific voices, and it produces a single audio file with up to four distinct, expressive speakers. It's designed for anyone who needs high-quality, long-form conversational audio without recording multiple people.
563 stars. No commits in the last 6 months.
Use this if you need to create realistic spoken dialogue, such as for podcasts, audiobooks, or explainer videos, and want to control multiple distinct voices from a text script.
Not ideal if you only need single-speaker narration or extremely short audio snippets, or if you prefer to record voices manually.
Stars
563
Forks
105
Language
Python
License
MIT
Category
Last pushed
Sep 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/wildminder/ComfyUI-VibeVoice"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
diodiogod/TTS-Audio-Suite
A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice...
Enemyx-net/VibeVoice-ComfyUI
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling...
wildminder/ComfyUI-VoxCPM
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
1038lab/ComfyUI-EdgeTTS
ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS...
eigenpunk/ComfyUI-audio
some generative audio tools for ComfyUI