Chatterbox-TTS-Server and Dia-TTS-Server

These are ecosystem siblings—both are self-hosted TTS servers built by the same developer using different underlying models (Chatterbox vs. Dia), allowing users to choose which model better suits their use case rather than use them together.

Chatterbox-TTS-Server

Verified

Dia-TTS-Server

Emerging

Maintenance 20/25

Adoption 10/25

Maturity 15/25

Community 25/25

Maintenance 2/25

Adoption 10/25

Maturity 15/25

Community 21/25

Stars: 1,101

Forks: 267

Downloads: —

Commits (30d): 23

Language: Python

License: MIT

Stars: 346

Forks: 63

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

No Package No Dependents

Stale 6m No Package No Dependents

About Chatterbox-TTS-Server

devnen/Chatterbox-TTS-Server

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.

This tool helps you convert written text into high-quality spoken audio using various voices and languages. You provide text and, optionally, a voice to clone, and it outputs realistic speech, even for long documents like audiobooks. It's designed for content creators, marketers, educators, and anyone needing to generate expressive voiceovers or audio content.

audiobook-production voiceover-creation e-learning-content multilingual-communication podcast-generation

About Dia-TTS-Server

devnen/Dia-TTS-Server

Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and GPU/CPU execution.

This project provides a straightforward way to turn written text into natural-sounding speech, especially for realistic conversations. You input text, optionally with speaker labels or example audio, and it produces high-quality audio recordings. Voice actors, content creators, and educators can use this to generate spoken content efficiently.

voice-generation content-creation dialogue-synthesis audiobook-production e-learning

Related comparisons

Chatterbox-TTS-Server and voicebox Chatterbox-TTS-Server and xtts-api-server Chatterbox-TTS-Server and ChatTTS-ui Chatterbox-TTS-Server and chatterbox-finetuning Chatterbox-TTS-Server and vox-box Chatterbox-TTS-Server and Chatterbox-TTS-Extended

Scores updated daily from GitHub, PyPI, and npm data. How scores work