k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

69
/ 100
Established

This project helps you convert spoken audio into written text accurately and efficiently, leveraging advanced AI models. You feed it audio recordings, and it provides precise transcriptions. It's ideal for anyone who needs to quickly get text from speech, like transcribers, content creators, or accessibility developers.

896 stars. Actively maintained with 8 commits in the last 30 days.

Use this if you need a high-performance system to deploy pre-trained speech-to-text models and transcribe audio with end-to-end AI.

Not ideal if you're looking to train or fine-tune your own custom speech-to-text models; for that, you should explore other tools.

audio-transcription speech-to-text voice-processing audio-analysis accessibility-tech
No Package No Dependents
Maintenance 20 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

896

Forks

146

Language

C++

License

Apache-2.0

Last pushed

Mar 18, 2026

Commits (30d)

8

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/k2-fsa/sherpa"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.