FunASR Speech Recognition Voice AI Tools

Speech recognition APIs and clients built on or wrapping FunASR and similar open-source ASR frameworks. Includes deployment servers, language bindings, and integration layers. Does NOT include text-to-speech, voice assistants, or end-user applications using ASR as a component.

There are 53 funasr speech recognition tools tracked. 1 score above 70 (verified tier). The highest-rated is PaddlePaddle/PaddleSpeech at 74/100 with 12,556 stars. 3 of the top 10 are actively maintained.

Get all 53 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=funasr-speech-recognition&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model,...

74
Verified
2 k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

69
Established
3 Picovoice/cheetah

On-device streaming speech-to-text engine powered by deep learning

68
Established
4 yeyupiaoling/YeAudio

Python的音频工具

52
Established
5 zaigie/FunSpeech

开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端

52
Established
6 manyeyes/ManySpeech

AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment...

52
Established
7 Picovoice/leopard

On-device speech-to-text engine powered by deep learning

49
Emerging
8 sipeed/Maix-Speech

Maix Speech AI lib, a fast and small speech lib running on embedded devices,...

46
Emerging
9 cvqluu/simple_diarizer

Simplified diarization pipeline using some pretrained models - audio file to...

46
Emerging
10 chenkui164/FastASR

这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。...

46
Emerging
11 Quantatirsk/funasr-api

Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52...

46
Emerging
12 lukeewin/FunASR_API

这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech...

46
Emerging
13 atomiechen/FunASR-Client

Really easy-to-use Python client for FunASR runtime server.

45
Emerging
14 RapidAI/RapidASR

📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR...

45
Emerging
15 jianchang512/fireredasr-ui

一个中文语音转文字项目,封装自FireRedASR

41
Emerging
16 zhangzijie-pro/Speaker-Verification

Dual-model speech AI toolkit for speaker verification and speaker-aware...

41
Emerging
17 bgArray/ZhiYin

知音 - AI音频听觉功能集成软件。提供声乐技术识别分析、伴奏分离等伴奏多种工具。

40
Emerging
18 PhuocElec/zipformer-asr-api

REST-API implementation of ZipFormer for automatic speech recognition (ASR)...

40
Emerging
19 xhuvom/omnilingual-ASR-Web-Dashboard

Meta Omnilingual ASR web based dashboard for testing and API based...

38
Emerging
20 kroko-ai/kroko-onnx

Kroko ASR - Speech-to-text

38
Emerging
21 tsengia/JSGFKit_Plus_Plus

A C++ library for parsing and manipulating JSGF grammar files.

36
Emerging
22 qkl9527/voice-assistant

基于Funasr的[实时]AI语音助手

35
Emerging
23 Ikaros-521/FunASR_WS

基于FunASR官方Demo修改的WS服务端,配合FastAPI提供HTTP服务,可以在浏览器中进行实时ASR测试

35
Emerging
24 jaganadhg/nemoexamples

Experiments with NVIDIA NeMo

35
Emerging
25 taeyoun811/Whisfusion

Whisfusion: Parallel ASR Decoding via a Diffusion Transformer

34
Emerging
26 huakunyang/SummerAsr

SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese...

31
Emerging
27 binglel/asr_baidu_web_server

asr web server based on flask

31
Emerging
28 SzLeaves/asr-webapp

ASR Web APP 中文语音识别实验室APP,使用Django构建,包含中文语音转文字与中文语音聊天机器人模块

30
Emerging
29 Anwarvic/Web-Interface-for-NVIDIA-NeMo

This repository contains an attempt to utilize the NeMo toolkit created by NVIDIA

29
Experimental
30 wq2012/VB_diarization

VB Diarization with Eigenvoice and HMM Priors, refactored

28
Experimental
31 Kaljurand/Grammars

Grammatical Framework based speech recognition grammars for Estonian,...

28
Experimental
32 terry-yip/speech-to-text

Speaker diarization and speech to text

27
Experimental
33 ArenAcikgoz/Whisper-Alignment

Forced alignment decoder for Whisper.

27
Experimental
34 vahnxu/doubao-asr

Agent Skill: Transcribe audio files via ByteDance Volcengine Seed-ASR 2.0...

27
Experimental
35 HsiangNianian/funasr-api

FunASR API is a FastAPI-based inference gateway that wraps multiple FunASR...

26
Experimental
36 atomiechen/funasr-client-ts

Really easy-to-use Typescript client for FunASR runtime server.

24
Experimental
37 yuhanwang14/ASR-Pipeline

Local GPU-accelerated speech transcription pipeline with speaker diarization...

23
Experimental
38 SunPCSolutions/DiarASR

Enterprise-Grade Secure ASR Diarization Pipeline - HIPAA-compliant speech...

22
Experimental
39 aidayang/FunASR-OneClick

FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件

21
Experimental
40 adamelkholyy/hpc-nemo

Fork for running Whisper transcriptions with Nemo diarization on University...

20
Experimental
41 adityajn105/google_speech_diarization_demo

A demo to show Speech Diarization (seperating audio of different speaker)...

19
Experimental
42 moziarnj07-sys/doubaoime-asr

🎤 Enable voice recognition for the Doubao input method using Python; ideal...

19
Experimental
43 jaycollett/hass_nemo

Simple Python Docker exposing an API using Nemo to perform text...

18
Experimental
44 kaka-lin/multi-asr-toolkit

A flexible speech recognition toolkit supporting multiple backends...

18
Experimental
45 aaaastark/NeMo-WeightsBiases-TTS

Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases

17
Experimental
46 DDDeeeee/Teasr

Microphone-free speech recognition and text polishing for vibe coding.

17
Experimental
47 EthanLifeGreat/AudioPsyChat

这是一个在服务器本地运行的web语音心理咨询系统,咨询系统内核使用[PsyChat],我们为其制作了Web前端,并拼接了ASR和TTS组件,使局域网内用户...

12
Experimental
48 scionoftech/speaker_diarization

speaker diarization using spectralcluster and Deeplearning

12
Experimental
49 lissettecarlr/AutomaticSpeechRecognition

语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库

12
Experimental
50 D-Keqi/Implementation-for-ASR-by-API-of-Baidu

This is an open source code that you can use to connect to Baidu's API to...

11
Experimental
51 Nanfengzhiwo1/XunFeiASR

This is a Automatic Speech Recognition project

11
Experimental
52 kensonhui/Speaker-Diarization-Sentiment-Analysis

This project performs speech recognition and diarization (speaker...

10
Experimental
53 fabianbusch/zamia-asr-dockerization

Dockerized HTTP-Abstraction of Zamia-ASR Scripts for easy-to-use transcription.

10
Experimental