FireRedTeam/FireRedASR
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
This tool helps you accurately convert spoken audio into written text, handling Mandarin, Chinese dialects, and English. You input audio files, and it outputs precise text transcripts, even recognizing singing lyrics. It's designed for professionals in media, call centers, or content creation who need reliable transcription.
1,796 stars.
Use this if you need highly accurate, industrial-grade transcription for Mandarin, Chinese dialects, or English audio, including singing.
Not ideal if your audio inputs are consistently longer than 30-60 seconds, as this can lead to transcription issues or errors.
Stars
1,796
Forks
159
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 25, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/FireRedTeam/FireRedASR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
meizhong986/WhisperJAV
ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
itsmevictor/clean-transcribe
A simple CLI to transcribe Youtube videos or local audio/video files and produce LLM-cleaned...
vivekuppal/transcribe
Transcribe is a real time transcription, conversation, Language learning platform. It provides...
BryceWG/BiBi-Keyboard
说点啥(BiBi Keyboard):一个基于 Kotlin 的 Android 平台的 LLM 与 ASR 语音输入法键盘应用 An LLM ASR voice input method...
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI