chenkui164/FastASR

这是一个用C++实现ASR推理的项目，它依赖很少，安装也很简单，推理速度很快，在树莓派4B等ARM平台也可以流畅的运行。支持的模型是由Google的Transformer模型中优化而来，数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时)，所以识别效果也很好，可以媲美许多商用的ASR软件。

/ 100

Emerging

This project offers robust speech-to-text conversion for Mandarin Chinese and some English audio, transforming spoken words into accurate text. It's designed for situations where you need to quickly process audio on devices like embedded systems or single-board computers, without relying on powerful GPUs. This is ideal for developers and engineers building applications that require efficient local voice transcription.

547 stars. No commits in the last 6 months.

Use this if you need to integrate fast, accurate Chinese and English speech recognition directly into your applications, especially on resource-constrained hardware like Raspberry Pi.

Not ideal if you need a pre-built, end-user application for speech-to-text, or if your primary requirement is transcription for languages other than Chinese and basic English.

speech-to-text voice-recognition embedded-systems audio-transcription real-time-processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

547

Forks

Language

License

Apache-2.0

Higher-rated alternatives

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...

k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

yeyupiaoling/YeAudio

Python的音频工具

zaigie/FunSpeech

开箱即用的本地私有化部署语音服务，快速搭建FunASR与CosyVoice2/3后端

Explore Voice AI Tools

All categories Trending Voice AI directory Insights