thekartikeyamishra/VoiceCloner

The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.

26
/ 100
Experimental

This tool helps content creators, educators, or communicators quickly turn written text into spoken audio in 22 official Indian languages, including Sanskrit, as well as English. You provide text, select a language, and receive an audio file. It can also mimic a speaker's voice to generate new audio with similar speech patterns.

No commits in the last 6 months.

Use this if you need to create audio narration for documents, educational materials, or marketing content in multiple Indian languages without recording a human voice.

Not ideal if you require advanced, high-fidelity voice cloning or a graphical user interface for ease of use, as this is a basic version with a command-line interface.

content-creation e-learning multilingual-communication audio-narration digital-publishing
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 8 / 25
Community 14 / 25

How are scores calculated?

Stars

8

Forks

3

Language

Python

License

Last pushed

Dec 17, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/thekartikeyamishra/VoiceCloner"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.