TTS-Audio-Suite and ComfyUI-lethris-dia2

These are complements—the Audio Suite provides a unified multi-engine TTS framework that could incorporate Dia2 as an additional TTS backend option alongside its other supported models, allowing users to leverage Dia2's timestamp and caption generation capabilities within a broader voice synthesis workflow.

TTS-Audio-Suite
68
Established
ComfyUI-lethris-dia2
33
Emerging
Maintenance 25/25
Adoption 10/25
Maturity 15/25
Community 18/25
Maintenance 6/25
Adoption 1/25
Maturity 13/25
Community 13/25
Stars: 774
Forks: 71
Downloads:
Commits (30d): 79
Language: Python
License:
Stars: 1
Forks: 2
Downloads:
Commits (30d): 0
Language: Python
License: MIT
No Package No Dependents
No Package No Dependents

About TTS-Audio-Suite

diodiogod/TTS-Audio-Suite

A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools

This suite helps video producers, content creators, and educators quickly turn written scripts into natural-sounding speech across many languages and voices. You input your text, choose from various AI voices, and the system generates audio, complete with precise timing for subtitles. It's designed for anyone needing professional-grade voiceovers or narrated content without hiring voice actors.

video-production content-creation localization e-learning audio-narration

About ComfyUI-lethris-dia2

lord-lethris/ComfyUI-lethris-dia2

ComfyUI custom nodes for the Dia2 TTS model — generate speech, timestamps, and captions directly inside ComfyUI.

Scores updated daily from GitHub, PyPI, and npm data. How scores work