MML-Group/code4AVE-Speech

Source Code for AVE Speech Dataset

/ 100

Experimental

This project offers a comprehensive Mandarin speech corpus, AVE Speech, which provides synchronized audio, lip video, and surface electromyography (EMG) signals. It helps researchers develop and test robust speech recognition systems by offering 55+ hours of multi-modal data from 100 native speakers. Speech recognition researchers can use this dataset to train and evaluate models that analyze various types of input.

No commits in the last 6 months.

Use this if you are a speech recognition researcher or scientist looking for a large-scale, multi-modal Mandarin speech dataset to train advanced models.

Not ideal if you are looking for a pre-built, ready-to-deploy speech recognition application rather than a dataset for research and development.

speech-recognition computational-linguistics bio-signal-processing human-computer-interaction AI-research

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 5 / 25

Maturity 15 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Featured in

Things AI Won't Tell You About Building a Voice App Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Higher-rated alternatives

Uberi/speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

cmusphinx/pocketsphinx

A small speech recognizer

tensorflow/lingvo

Lingvo

modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models,...

PyThaiNLP/pythaiasr

Python Thai Automatic Speech Recognition

Explore Voice AI Tools

All categories Trending Voice AI directory Insights