TTS-Audio-Suite and ComfyUI-SparkTTS

These are complementary tools that serve different TTS engines within the same ComfyUI workflow—TTS-Audio-Suite provides multi-engine flexibility including RVC and various TTS models, while SparkTTS specializes in LLM-powered speech synthesis, allowing users to combine them for different quality/latency trade-offs in a single project.

TTS-Audio-Suite
68
Established
ComfyUI-SparkTTS
41
Emerging
Maintenance 25/25
Adoption 10/25
Maturity 15/25
Community 18/25
Maintenance 2/25
Adoption 10/25
Maturity 16/25
Community 13/25
Stars: 774
Forks: 71
Downloads:
Commits (30d): 79
Language: Python
License:
Stars: 124
Forks: 13
Downloads:
Commits (30d): 0
Language: Python
License: GPL-3.0
No Package No Dependents
Stale 6m No Package No Dependents

About TTS-Audio-Suite

diodiogod/TTS-Audio-Suite

A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools

This suite helps video producers, content creators, and educators quickly turn written scripts into natural-sounding speech across many languages and voices. You input your text, choose from various AI voices, and the system generates audio, complete with precise timing for subtitles. It's designed for anyone needing professional-grade voiceovers or narrated content without hiring voice actors.

video-production content-creation localization e-learning audio-narration

About ComfyUI-SparkTTS

1038lab/ComfyUI-SparkTTS

ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an advanced text-to-speech system that harnesses the power of large language models (LLMs) to generate highly accurate and natural-sounding speech.

This tool helps content creators, educators, or marketers generate natural-sounding speech from text using advanced AI. You provide text and, optionally, a reference audio sample, and it produces high-quality audio in a customized or cloned voice. It's designed for anyone needing realistic voiceovers or personalized audio content.

voice-generation audio-content-creation narration podcast-production e-learning

Scores updated daily from GitHub, PyPI, and npm data. How scores work