Enemyx-net/VibeVoice-ComfyUI
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
This tool helps content creators and storytellers generate natural-sounding speech from text directly within their ComfyUI workflows. You provide written scripts, and it outputs high-quality audio, including options for single voices or dynamic multi-speaker conversations. It's designed for anyone needing realistic voiceovers, character dialogue, or narrated content.
1,391 stars.
Use this if you need to create realistic spoken audio from text for videos, podcasts, or interactive experiences, and want fine-grained control over voices and speech characteristics.
Not ideal if you primarily need to transcribe existing audio to text, or if you require extremely lightweight, simple text-to-speech without advanced features like voice cloning or multi-speaker support.
Stars
1,391
Forks
219
Language
Python
License
MIT
Category
Last pushed
Feb 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Enemyx-net/VibeVoice-ComfyUI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
diodiogod/TTS-Audio-Suite
A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice...
wildminder/ComfyUI-VibeVoice
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
wildminder/ComfyUI-VoxCPM
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
1038lab/ComfyUI-EdgeTTS
ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS...
eigenpunk/ComfyUI-audio
some generative audio tools for ComfyUI