wildminder/ComfyUI-VoxCPM
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
This tool helps content creators, podcasters, or marketing professionals generate highly realistic speech from text. You provide text and, optionally, a short audio sample of a voice, and it outputs an audio file with the text spoken in that voice, complete with natural expression and tone. It's designed for anyone needing expressive, true-to-life voiceovers or cloned voices for various media.
390 stars.
Use this if you need to create natural-sounding voiceovers, clone a specific voice for consistent audio content, or generate expressive spoken dialogue from written scripts.
Not ideal if you need a simple, low-fidelity text-to-speech for quick drafts or if your project requires extremely fast processing on basic hardware without advanced voice cloning features.
Stars
390
Forks
42
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 17, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/wildminder/ComfyUI-VoxCPM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
diodiogod/TTS-Audio-Suite
A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice...
Enemyx-net/VibeVoice-ComfyUI
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling...
wildminder/ComfyUI-VibeVoice
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
1038lab/ComfyUI-EdgeTTS
ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS...
eigenpunk/ComfyUI-audio
some generative audio tools for ComfyUI