metacore-stack/modular-auto-specch-recog-toolkit
Building a modular, open-source toolkit that advances automatic speech recognition through reproducible workflows and community collaboration.
This toolkit helps machine learning engineers and researchers build, train, and deploy custom automatic speech recognition (ASR) systems. It takes raw audio files and corresponding transcripts to train ASR models, which then convert new audio into text. It's designed for individuals working on developing and improving speech-to-text technologies.
Use this if you are developing new automatic speech recognition models or need to fine-tune existing ones for specific datasets and applications.
Not ideal if you're looking for a ready-to-use, off-the-shelf speech-to-text API without needing to build or customize the underlying models.
Stars
8
Forks
2
Language
Python
License
AGPL-3.0
Category
Last pushed
Nov 12, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/metacore-stack/modular-auto-specch-recog-toolkit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
felixbur/nkululeko
Machine learning speaker characteristics
claritychallenge/clarity
Clarity Challenge toolkit - software for building Clarity Challenge systems
juanmc2005/diart
A python package to build AI-powered real-time audio applications
astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.