PaddleSpeech and RapidASR

RapidASR is a lightweight inference wrapper built on top of FunASR models, making it a complement that simplifies deployment of PaddleSpeech's ASR capabilities across platforms via ONNX Runtime rather than a competitor.

PaddleSpeech

Verified

RapidASR

Emerging

Maintenance 16/25

Adoption 10/25

Maturity 25/25

Community 23/25

Maintenance 0/25

Adoption 10/25

Maturity 16/25

Community 19/25

Stars: 12,556

Forks: 1,956

Downloads: —

Commits (30d): 3

Language: Python

License: Apache-2.0

Stars: 602

Forks: 70

Downloads: —

Commits (30d): 0

Language: C++

License: MIT

No risk flags

Stale 6m No Package No Dependents

About PaddleSpeech

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

This toolkit helps you work with spoken language, allowing you to convert audio into written text, translate spoken English to Chinese, and generate natural-sounding speech from written text. It takes audio files or text as input and produces transcribed text, translated text, or synthetic speech. Anyone who needs to process or create speech, such as content creators, linguists, or call center managers, would find this useful.

speech-to-text text-to-speech audio-translation voice-generation language-processing

About RapidASR

RapidAI/RapidASR

📣 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

This tool converts spoken audio, including mixed Chinese and English, into written text with punctuation. You provide audio recordings, and it delivers accurate text transcripts. It's designed for anyone needing to quickly and reliably transcribe spoken words, such as content creators, researchers, or customer service professionals.

audio-transcription speech-to-text content-creation multilingual-communication data-entry-automation

Scores updated daily from GitHub, PyPI, and npm data. How scores work