1038lab/ComfyUI-FireRedTTS

A ComfyUI integration for FireRedTTS‑2, a real-time multi-speaker TTS system enabling high-quality, emotionally expressive dialogue and monologue synthesis. Leveraging a streaming architecture and context-aware prosody modeling, it supports natural speaker turns and stable long-form generation, ideal for interactive chat and podcast applications.

32
/ 100
Emerging

This tool helps content creators, podcasters, and educators turn written scripts into natural-sounding speech, featuring multiple speakers if needed. You provide text, optionally with reference audio for specific voices, and it generates high-quality, emotionally expressive audio files. It's ideal for anyone creating spoken content without needing professional voice actors.

No commits in the last 6 months.

Use this if you need to quickly generate realistic, multi-speaker dialogue or single-speaker narration for podcasts, audiobooks, e-learning modules, or interactive chat applications, and you want to potentially clone specific voices.

Not ideal if you require a very small file size for the underlying model or are working with extremely limited computing resources without a dedicated GPU.

podcasting audiobook-production e-learning-content voice-over virtual-assistant-dialogue
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 7 / 25
Maturity 15 / 25
Community 8 / 25

How are scores calculated?

Stars

41

Forks

3

Language

Python

License

GPL-3.0

Last pushed

Sep 16, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/1038lab/ComfyUI-FireRedTTS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.