mozi1924/Qwen3-TTS-EasyFinetuning

Easy fine-tuning for Qwen3-TTS: Fast voice cloning and high-quality multilingual speech synthesis.

37
/ 100
Emerging

This tool helps you create high-quality, custom voice models from your own audio recordings. You provide raw audio files of a speaker, and it produces a voice model that can speak any text with that speaker's unique voice and natural expression, even in different languages. This is ideal for content creators, game developers, or anyone needing consistent, branded voiceovers.

Use this if you need to generate production-quality, stable, and expressive speech in a specific voice, including cross-lingual synthesis without accents.

Not ideal if you only need quick, experimental voice cloning and don't require the highest quality or fine-tuned expressiveness.

voice-cloning speech-synthesis audio-production content-creation multilingual-voiceover
No Package No Dependents
Maintenance 10 / 25
Adoption 7 / 25
Maturity 11 / 25
Community 9 / 25

How are scores calculated?

Stars

32

Forks

3

Language

Python

License

Apache-2.0

Category

llm-fine-tuning

Last pushed

Mar 01, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mozi1924/Qwen3-TTS-EasyFinetuning"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.