RapidAI/RapidASR
📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.
This tool converts spoken audio, including mixed Chinese and English, into written text with punctuation. You provide audio recordings, and it delivers accurate text transcripts. It's designed for anyone needing to quickly and reliably transcribe spoken words, such as content creators, researchers, or customer service professionals.
602 stars. No commits in the last 6 months.
Use this if you need an out-of-the-box solution for transcribing audio files containing both Chinese and English speech into text, suitable for various operating systems.
Not ideal if you need a solution for languages other than Chinese and English, or if you require highly specialized domain-specific transcription that might need custom model training.
Stars
602
Forks
70
Language
C++
License
MIT
Category
Last pushed
May 15, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/RapidAI/RapidASR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with...
k2-fsa/sherpa
Speech-to-text server framework with next-gen Kaldi
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
yeyupiaoling/YeAudio
Python的音频工具
zaigie/FunSpeech
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端