srvk/eesen

The official repository of the Eesen project

51
/ 100
Established

This project helps speech scientists and researchers build end-to-end Automatic Speech Recognition (ASR) systems. It takes raw speech audio as input and directly outputs transcribed text. This is designed for those who want to simplify the traditional ASR pipeline, bypassing complex components like Hidden Markov Models and Gaussian Mixture Models, using modern recurrent neural networks.

834 stars. No commits in the last 6 months.

Use this if you are developing ASR systems and want a streamlined, deep learning-based approach to convert speech audio into text.

Not ideal if you are looking for a pre-trained, off-the-shelf speech-to-text API or a tool for general audio processing.

speech-recognition acoustic-modeling natural-language-processing deep-learning-research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

834

Forks

339

Language

C++

License

Apache-2.0

Last pushed

May 23, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/srvk/eesen"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.