Onuronon-lab/Shrutik

Open-source voice data collection platform for building inclusive voice datasets. Collaborative transcription with quality consensus. FastAPI + React + PostgreSQL.

/ 100

Emerging

This platform helps communities create high-quality voice datasets for underrepresented languages. Native speakers record and transcribe their voices, which the platform processes to build inclusive voice technology. This is ideal for linguistic communities, researchers, and organizations aiming to develop speech recognition systems for regional or minority languages.

Use this if you want to gather, transcribe, and validate voice recordings from a community to build speech technology for a language that current systems don't support well.

Not ideal if you need a pre-built speech recognition engine or a platform for general audio transcription unrelated to voice dataset creation.

linguistic-diversity voice-technology language-preservation crowdsourcing speech-dataset-creation

No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 13 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

voicegain/platform

Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)

aws-samples/amazon-transcribe-live-call-analytics

Amazon Transcribe Live Call Analytics (LCA) Sample Solution

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while...

davidamacey/OpenTranscribe

Self-hosted AI-powered transcription platform with speaker diarization, search, and...

jim-schwoebel/voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python (10...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights