thekartikeyamishra/VoiceCloner

The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.

/ 100

Experimental

This tool helps content creators, educators, or communicators quickly turn written text into spoken audio in 22 official Indian languages, including Sanskrit, as well as English. You provide text, select a language, and receive an audio file. It can also mimic a speaker's voice to generate new audio with similar speech patterns.

No commits in the last 6 months.

Use this if you need to create audio narration for documents, educational materials, or marketing content in multiple Indian languages without recording a human voice.

Not ideal if you require advanced, high-fidelity voice cloning or a graphical user interface for ease of use, as this is a basic version with a command-line interface.

content-creation e-learning multilingual-communication audio-narration digital-publishing

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

IS2AI/TurkicASR

A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash,...

seanpm2001/Phoneticut

Phoneticut is a voice actor replacement: Make a certain amount of sounds, and have stitching and...

ammosu/qwen3-tts-voice-clone

A full-stack voice cloning web application powered by Qwen3-TTS. Clone any voice with 3-10...

sekalf/MioTTS-llama.cpp

Create fast, lightweight text-to-speech audio on your CPU with MioTTS-llama.cpp. Convert text to...

moaz11112/qwen3-tts-enhanced

🎤 Clone voices in seconds with Qwen3-TTS Enhanced. Enjoy local, GPU-powered multi-reference...

Explore ML Frameworks

All categories Trending ML Framework directory Insights