gokulkarthik/text2speech
Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023
This project offers advanced text-to-speech models specifically for 13 Indian languages like Hindi, Tamil, and Bengali. It takes written text as input and generates natural-sounding spoken audio. Businesses, educators, and content creators aiming to produce localized audio content for an Indian audience will find this beneficial.
No commits in the last 6 months.
Use this if you need high-quality, natural-sounding voiceovers or audio output in multiple Indian languages for applications, educational materials, or content.
Not ideal if your primary need is for languages other than the 13 specified Indian languages, or if you require real-time, low-latency speech generation in extremely resource-constrained environments.
Stars
57
Forks
8
Language
Jupyter Notebook
License
—
Category
Last pushed
May 07, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gokulkarthik/text2speech"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
machinelearningZH/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
shhossain/BanglaSpeech2Text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...
oseiskar/autosubsync
Automatically synchronize subtitles with audio using machine learning