PaddleSpeech and RapidASR
RapidASR is a lightweight inference wrapper built on top of FunASR models, making it a complement that simplifies deployment of PaddleSpeech's ASR capabilities across platforms via ONNX Runtime rather than a competitor.
About PaddleSpeech
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
This toolkit helps you work with spoken language, allowing you to convert audio into written text, translate spoken English to Chinese, and generate natural-sounding speech from written text. It takes audio files or text as input and produces transcribed text, translated text, or synthetic speech. Anyone who needs to process or create speech, such as content creators, linguists, or call center managers, would find this useful.
About RapidASR
RapidAI/RapidASR
📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.
This tool converts spoken audio, including mixed Chinese and English, into written text with punctuation. You provide audio recordings, and it delivers accurate text transcripts. It's designed for anyone needing to quickly and reliably transcribe spoken words, such as content creators, researchers, or customer service professionals.
Scores updated daily from GitHub, PyPI, and npm data. How scores work