k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

/ 100

Established

This project helps you convert spoken audio into written text accurately and efficiently, leveraging advanced AI models. You feed it audio recordings, and it provides precise transcriptions. It's ideal for anyone who needs to quickly get text from speech, like transcribers, content creators, or accessibility developers.

896 stars. Actively maintained with 8 commits in the last 30 days.

Use this if you need a high-performance system to deploy pre-trained speech-to-text models and transcribe audio with end-to-end AI.

Not ideal if you're looking to train or fine-tune your own custom speech-to-text models; for that, you should explore other tools.

audio-transcription speech-to-text voice-processing audio-analysis accessibility-tech

No Package No Dependents

Maintenance 20 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

896

Forks

146

Language

C++

License

Apache-2.0

Related tools

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

yeyupiaoling/YeAudio

Python的音频工具

zaigie/FunSpeech

开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端

manyeyes/ManySpeech

AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights