biyoml/End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

/ 100

Emerging

This project offers an end-to-end solution for converting spoken Mandarin audio into written Chinese text. It takes audio recordings in Mandarin as input and produces their corresponding text transcripts. This would be used by researchers, developers, or linguists working on speech recognition systems for Mandarin.

No commits in the last 6 months.

Use this if you need a baseline or a starting point for building an automatic speech recognition (ASR) system specifically for Mandarin Chinese, using the AISHELL dataset.

Not ideal if you're looking for a pre-trained, ready-to-use speech-to-text API for a commercial application or a general-purpose Mandarin ASR solution without customization.

Mandarin speech recognition ASR research linguistic AI Chinese language processing

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights