TTS-Audio-Suite and ComfyUI-VoxCPMTTS

These are complements that work together—TTS-Audio-Suite provides a unified multi-engine framework that can integrate VoxCPM-TTS as one of several available text-to-speech backends within a single ComfyUI workflow.

TTS-Audio-Suite

Established

ComfyUI-VoxCPMTTS

Emerging

Maintenance 25/25

Adoption 10/25

Maturity 15/25

Community 18/25

Maintenance 6/25

Adoption 7/25

Maturity 15/25

Community 8/25

Stars: 774

Forks: 71

Downloads: —

Commits (30d): 79

Language: Python

License: —

Stars: 36

Forks: 3

Downloads: —

Commits (30d): 0

Language: Python

License: GPL-3.0

No Package No Dependents

About TTS-Audio-Suite

diodiogod/TTS-Audio-Suite

A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools

This suite helps video producers, content creators, and educators quickly turn written scripts into natural-sounding speech across many languages and voices. You input your text, choose from various AI voices, and the system generates audio, complete with precise timing for subtitles. It's designed for anyone needing professional-grade voiceovers or narrated content without hiring voice actors.

video-production content-creation localization e-learning audio-narration

About ComfyUI-VoxCPMTTS

1038lab/ComfyUI-VoxCPMTTS

A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech) functionality. This implementation provides high-quality speech generation and voice cloning capabilities using the VoxCPM 1.5 model.

This tool helps content creators, marketers, and educators generate natural-sounding speech from text or clone existing voices for audio content. You provide written text and optionally a voice recording, and it produces high-quality audio narration or speech that mimics the cloned voice. It's designed for anyone needing realistic spoken audio without hiring voice talent or specialized recording equipment.

audio-content-creation voice-over digital-narration synthetic-media e-learning

Related comparisons

TTS-Audio-Suite and VibeVoice-ComfyUI TTS-Audio-Suite and ComfyUI-VibeVoice TTS-Audio-Suite and ComfyUI-VoxCPM TTS-Audio-Suite and ComfyUI-EdgeTTS TTS-Audio-Suite and ComfyUI-XTTS TTS-Audio-Suite and ComfyUI-Maya1_TTS

Scores updated daily from GitHub, PyPI, and npm data. How scores work