thekartikeyamishra/VoiceCloner
The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.
This tool helps content creators, educators, or communicators quickly turn written text into spoken audio in 22 official Indian languages, including Sanskrit, as well as English. You provide text, select a language, and receive an audio file. It can also mimic a speaker's voice to generate new audio with similar speech patterns.
No commits in the last 6 months.
Use this if you need to create audio narration for documents, educational materials, or marketing content in multiple Indian languages without recording a human voice.
Not ideal if you require advanced, high-fidelity voice cloning or a graphical user interface for ease of use, as this is a basic version with a command-line interface.
Stars
8
Forks
3
Language
Python
License
—
Category
Last pushed
Dec 17, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/thekartikeyamishra/VoiceCloner"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
IS2AI/TurkicASR
A multilingual ASR model that can recognize ten Turkic languagesāAzerbaijani, Bashkir, Chuvash,...
seanpm2001/Phoneticut
Phoneticut is a voice actor replacement: Make a certain amount of sounds, and have stitching and...
ammosu/qwen3-tts-voice-clone
A full-stack voice cloning web application powered by Qwen3-TTS. Clone any voice with 3-10...
sekalf/MioTTS-llama.cpp
Create fast, lightweight text-to-speech audio on your CPU with MioTTS-llama.cpp. Convert text to...
moaz11112/qwen3-tts-enhanced
š¤ Clone voices in seconds with Qwen3-TTS Enhanced. Enjoy local, GPU-powered multi-reference...