gokulkarthik/text2speech

Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023

/ 100

Emerging

This project offers advanced text-to-speech models specifically for 13 Indian languages like Hindi, Tamil, and Bengali. It takes written text as input and generates natural-sounding spoken audio. Businesses, educators, and content creators aiming to produce localized audio content for an Indian audience will find this beneficial.

No commits in the last 6 months.

Use this if you need high-quality, natural-sounding voiceovers or audio output in multiple Indian languages for applications, educational materials, or content.

Not ideal if your primary need is for languages other than the 13 specified Indian languages, or if you require real-time, low-latency speech generation in extremely resource-constrained environments.

audio-content-creation localization e-learning accessibility voice-assistants

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

machinelearningZH/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.

saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

shhossain/BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned...

oseiskar/autosubsync

Automatically synchronize subtitles with audio using machine learning

Explore Voice AI Tools

All categories Trending Voice AI directory Insights