ComfyUI-VibeVoice and ComfyUI-VoxCPMTTS
These are complementary TTS nodes that offer different synthesis approaches—VibeVoice excels at expressive long-form conversational audio while VoxCPM specializes in voice cloning—allowing users to choose the best tool for their specific speech generation needs within ComfyUI.
About ComfyUI-VibeVoice
wildminder/ComfyUI-VibeVoice
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
This tool helps content creators, podcasters, and educators generate natural-sounding, multi-speaker audio conversations from a written script. You provide a text dialogue and optionally some reference audio clips for specific voices, and it produces a single audio file with up to four distinct, expressive speakers. It's designed for anyone who needs high-quality, long-form conversational audio without recording multiple people.
About ComfyUI-VoxCPMTTS
1038lab/ComfyUI-VoxCPMTTS
A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech) functionality. This implementation provides high-quality speech generation and voice cloning capabilities using the VoxCPM 1.5 model.
This tool helps content creators, marketers, and educators generate natural-sounding speech from text or clone existing voices for audio content. You provide written text and optionally a voice recording, and it produces high-quality audio narration or speech that mimics the cloned voice. It's designed for anyone needing realistic spoken audio without hiring voice talent or specialized recording equipment.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work