Onuronon-lab/Shrutik
Open-source voice data collection platform for building inclusive voice datasets. Collaborative transcription with quality consensus. FastAPI + React + PostgreSQL.
This platform helps communities create high-quality voice datasets for underrepresented languages. Native speakers record and transcribe their voices, which the platform processes to build inclusive voice technology. This is ideal for linguistic communities, researchers, and organizations aiming to develop speech recognition systems for regional or minority languages.
Use this if you want to gather, transcribe, and validate voice recordings from a community to build speech technology for a language that current systems don't support well.
Not ideal if you need a pre-built speech recognition engine or a platform for general audio transcription unrelated to voice dataset creation.
Stars
11
Forks
8
Language
Python
License
—
Category
Last pushed
Mar 10, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Onuronon-lab/Shrutik"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
voicegain/platform
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
aws-samples/amazon-transcribe-live-call-analytics
Amazon Transcribe Live Call Analytics (LCA) Sample Solution
SamirPaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while...
davidamacey/OpenTranscribe
Self-hosted AI-powered transcription platform with speaker diarization, search, and...
jim-schwoebel/voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10...