wildminder/ComfyUI-VibeVoice

ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio

50
/ 100
Established

This tool helps content creators, podcasters, and educators generate natural-sounding, multi-speaker audio conversations from a written script. You provide a text dialogue and optionally some reference audio clips for specific voices, and it produces a single audio file with up to four distinct, expressive speakers. It's designed for anyone who needs high-quality, long-form conversational audio without recording multiple people.

563 stars. No commits in the last 6 months.

Use this if you need to create realistic spoken dialogue, such as for podcasts, audiobooks, or explainer videos, and want to control multiple distinct voices from a text script.

Not ideal if you only need single-speaker narration or extremely short audio snippets, or if you prefer to record voices manually.

podcasting audiobook creation content generation e-learning development dialogue synthesis
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 15 / 25
Community 23 / 25

How are scores calculated?

Stars

563

Forks

105

Language

Python

License

MIT

Last pushed

Sep 25, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/wildminder/ComfyUI-VibeVoice"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.