FunAudioLLM/Fun-ASR

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

/ 100

Established

This project helps convert spoken words into accurate written text, even in noisy environments or when people speak different languages or dialects. You feed it audio recordings, and it produces a precise transcription. Anyone needing to document conversations, analyze speech, or create captions from audio, such as educators, financial analysts, or content creators, would find this tool useful.

946 stars.

Use this if you need highly accurate, real-time transcriptions of audio, especially for recordings with background noise, multiple languages/dialects, or specialized industry terminology.

Not ideal if your primary need is for speaker identification and separation, as this feature is still under development.

speech-to-text audio-transcription language-processing multilingual-communication education-tech

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 13 / 25

Community 17 / 25

How are scores calculated?

Stars

946

Forks

Language

Python

License

Apache-2.0

Featured in

Things AI Won't Tell You About Building a Voice App Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Compare

Fun-ASR and FunASR

Related tools

Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

cmusphinx/pocketsphinx

A small speech recognizer

tensorflow/lingvo

Lingvo

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...

PyThaiNLP/pythaiasr

Python Thai Automatic Speech Recognition

Explore Voice AI Tools

All categories Trending Voice AI directory Insights