kroko-ai/kroko-onnx

Kroko ASR - Speech-to-text

/ 100

Emerging

This project provides an open-source solution for converting spoken language into written text using speech recognition. It takes audio input and outputs a transcript, making it useful for developers building applications that need to process spoken words. The output is a JSON-formatted text, including partial or final transcripts, segment details, and word-level timestamps. This tool is designed for software developers who need to integrate high-quality, fast speech-to-text capabilities into their applications.

138 stars. No commits in the last 6 months.

Use this if you are a software developer building applications on Android, web browsers, or servers, and you need to integrate fast, accurate speech-to-text functionality.

Not ideal if you are an end-user looking for a ready-to-use application, as this project requires development knowledge for integration.

speech-to-text voice-recognition application-development audio-processing real-time-transcription

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 11 / 25

How are scores calculated?

Stars

138

Forks

Language

C++

License

Apache-2.0

Higher-rated alternatives

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...

k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

yeyupiaoling/YeAudio

Python的音频工具

zaigie/FunSpeech

开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端

Explore Voice AI Tools

All categories Trending Voice AI directory Insights