Bassamejlaoui/Voice-Cloning-Translation-Transcription
Voice cloning, a revolutionary technology, allows us to replicate and recreate human voices with remarkable accuracy. This innovation has the potential to transform the way we interact with each other, machines, and the world around us.
This project helps content creators, educators, and customer service professionals generate realistic speech in a specific voice across multiple languages. You provide a short audio clip of a voice and some text, and it produces an audio file of that text spoken in the cloned voice. It also offers tools to transcribe audio into text and translate spoken content.
No commits in the last 6 months.
Use this if you need to create consistent voiceovers, localized audio content, or generate speech from text while maintaining a distinct vocal identity.
Not ideal if you're looking for a simple voice modulator for live calls or need real-time, bidirectional voice conversations.
Stars
8
Forks
2
Language
—
License
CC0-1.0
Category
Last pushed
May 28, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Bassamejlaoui/Voice-Cloning-Translation-Transcription"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
travisvn/chatterbox-tts-api
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate...
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment...
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
sfortis/openai_tts
Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible...
OpenMOSS/MOSS-TTSD
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis....