kroko-ai/kroko-onnx
Kroko ASR - Speech-to-text
This project provides an open-source solution for converting spoken language into written text using speech recognition. It takes audio input and outputs a transcript, making it useful for developers building applications that need to process spoken words. The output is a JSON-formatted text, including partial or final transcripts, segment details, and word-level timestamps. This tool is designed for software developers who need to integrate high-quality, fast speech-to-text capabilities into their applications.
138 stars. No commits in the last 6 months.
Use this if you are a software developer building applications on Android, web browsers, or servers, and you need to integrate fast, accurate speech-to-text functionality.
Not ideal if you are an end-user looking for a ready-to-use application, as this project requires development knowledge for integration.
Stars
138
Forks
10
Language
C++
License
Apache-2.0
Category
Last pushed
Oct 07, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kroko-ai/kroko-onnx"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
yeyupiaoling/YeAudio
Python的音频工具
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端