gokhaneraslan/chatterbox-finetuning

Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with smart vocabulary extension. Features offline preprocessing, automatic VAD trimming, and voice cloning capabilities. Train custom TTS models with your own dataset in LJSpeech and file-based format.

51
/ 100
Established

This toolkit helps you create custom text-to-speech (TTS) voices using your own audio recordings and text. You provide a dataset of recorded speech (like an LJSpeech dataset or individual audio and text files), and it produces a unique voice model that can generate high-quality speech in many languages. This is ideal for content creators, educators, or businesses needing a consistent, personalized voice for their digital content.

Use this if you need to train a custom text-to-speech voice from your own audio data, especially if you want to support specific languages or unique pronunciations.

Not ideal if you're looking for an out-of-the-box text-to-speech service that doesn't require training a new model.

voice-synthesis audio-content-creation localization digital-voice-cloning speech-technology
No Package No Dependents
Maintenance 10 / 25
Adoption 9 / 25
Maturity 13 / 25
Community 19 / 25

How are scores calculated?

Stars

84

Forks

21

Language

Python

License

Apache-2.0

Last pushed

Feb 20, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gokhaneraslan/chatterbox-finetuning"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.