Quantatirsk/funasr-api

Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务,支持 52 种语言,兼容 OpenAI API 与阿里云语音 API。

46
/ 100
Emerging

This project provides a local service for transcribing spoken audio into text, supporting 52 languages. You feed it an audio file or a live audio stream, and it outputs a written transcript, optionally identifying different speakers and providing word-level timestamps. It's designed for developers who need to integrate advanced speech-to-text capabilities into their applications, without relying on external cloud services.

191 stars.

Use this if you are a developer building an application that needs to convert spoken audio into text, identify multiple speakers, or process live audio streams locally.

Not ideal if you are an end-user looking for a ready-to-use desktop application or a simple web interface for transcribing audio.

audio-transcription speech-recognition-development real-time-audio-processing multi-language-support
No License No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 5 / 25
Community 18 / 25

How are scores calculated?

Stars

191

Forks

31

Language

Python

License

Last pushed

Mar 17, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Quantatirsk/funasr-api"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.