VibeVoice-ComfyUI and ComfyUI-SparkTTS
These are complementary TTS nodes that can be used together in the same ComfyUI workflow to leverage different synthesis approaches—VibeVoice for multi-speaker voice cloning and SparkTTS for LLM-powered prosody generation.
About VibeVoice-ComfyUI
Enemyx-net/VibeVoice-ComfyUI
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
This tool helps content creators and storytellers generate natural-sounding speech from text directly within their ComfyUI workflows. You provide written scripts, and it outputs high-quality audio, including options for single voices or dynamic multi-speaker conversations. It's designed for anyone needing realistic voiceovers, character dialogue, or narrated content.
About ComfyUI-SparkTTS
1038lab/ComfyUI-SparkTTS
ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an advanced text-to-speech system that harnesses the power of large language models (LLMs) to generate highly accurate and natural-sounding speech.
This tool helps content creators, educators, or marketers generate natural-sounding speech from text using advanced AI. You provide text and, optionally, a reference audio sample, and it produces high-quality audio in a customized or cloned voice. It's designed for anyone needing realistic voiceovers or personalized audio content.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work