NeoKazuya/qwen3-tts-enhanced
Enhanced Qwen3-TTS voice cloning GUI with multi-reference samples, variation generation, and audio preprocessing.
This tool helps content creators, voice actors, and media producers quickly and easily clone any voice from a short audio sample. You provide an audio recording of a voice you want to replicate, and it generates high-quality, synthetic speech that sounds just like the original, including multiple variations for you to choose from. It's ideal for anyone needing to create custom voiceovers or dialogue with specific vocal characteristics.
Use this if you need to create custom voice content by cloning an existing voice, generating new speech in that voice, and want to do so entirely offline with your own GPU.
Not ideal if you don't have an NVIDIA GPU with at least 8GB of VRAM or if you primarily work on macOS.
Stars
17
Forks
2
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/NeoKazuya/qwen3-tts-enhanced"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...