Enemyx-net/VibeVoice-ComfyUI

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.

58
/ 100
Established

This tool helps content creators and storytellers generate natural-sounding speech from text directly within their ComfyUI workflows. You provide written scripts, and it outputs high-quality audio, including options for single voices or dynamic multi-speaker conversations. It's designed for anyone needing realistic voiceovers, character dialogue, or narrated content.

1,391 stars.

Use this if you need to create realistic spoken audio from text for videos, podcasts, or interactive experiences, and want fine-grained control over voices and speech characteristics.

Not ideal if you primarily need to transcribe existing audio to text, or if you require extremely lightweight, simple text-to-speech without advanced features like voice cloning or multi-speaker support.

content-creation voiceover storytelling audio-production digital-media
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 15 / 25
Community 23 / 25

How are scores calculated?

Stars

1,391

Forks

219

Language

Python

License

MIT

Last pushed

Feb 18, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Enemyx-net/VibeVoice-ComfyUI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.