ComfyUI-VibeVoice and ComfyUI-EdgeTTS
About ComfyUI-VibeVoice
wildminder/ComfyUI-VibeVoice
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
This tool helps content creators, podcasters, and educators generate natural-sounding, multi-speaker audio conversations from a written script. You provide a text dialogue and optionally some reference audio clips for specific voices, and it produces a single audio file with up to four distinct, expressive speakers. It's designed for anyone who needs high-quality, long-form conversational audio without recording multiple people.
About ComfyUI-EdgeTTS
1038lab/ComfyUI-EdgeTTS
ComfyUI-EdgeTTS is a powerful text-to-speech node for ComfyUI, leveraging Microsoft's Edge TTS capabilities. It enables seamless conversion of text into natural-sounding speech, supporting multiple languages and voices. Ideal for enhancing user interactions, this node is easy to integrate and customize, making it perfect for various applications.
This tool helps create natural-sounding speech from text and transcribe spoken audio into text. You provide written text in various languages and choose from many voices, or upload an audio file. The tool then produces a spoken audio file or a written transcript. It's designed for content creators, educators, or anyone needing to generate voiceovers or analyze spoken content.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work