biyoml/End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
This project offers an end-to-end solution for converting spoken Mandarin audio into written Chinese text. It takes audio recordings in Mandarin as input and produces their corresponding text transcripts. This would be used by researchers, developers, or linguists working on speech recognition systems for Mandarin.
No commits in the last 6 months.
Use this if you need a baseline or a starting point for building an automatic speech recognition (ASR) system specifically for Mandarin Chinese, using the AISHELL dataset.
Not ideal if you're looking for a pre-trained, ready-to-use speech-to-text API for a commercial application or a general-purpose Mandarin ASR solution without customization.
Stars
34
Forks
8
Language
Python
License
—
Category
Last pushed
Nov 09, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/biyoml/End-to-End-Mandarin-ASR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowASR
:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....
dangvansam/viet-asr
VietASR - Vietnamese Automatic Speech Recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
srvk/eesen
The official repository of the Eesen project