athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

/ 100

Established

This open-source project provides tools for working with spoken language. It takes audio recordings or text as input and can convert speech to text, generate speech from text, detect voice activity, or spot keywords. It's designed for researchers and developers building speech-enabled applications or services.

970 stars. No commits in the last 6 months.

Use this if you need to build or research advanced speech processing capabilities like automatic speech recognition, text-to-speech, or voice activity detection.

Not ideal if you need an out-of-the-box, user-friendly application for everyday tasks that don't require custom model development or integration.

speech-to-text text-to-speech voice-detection keyword-spotting spoken-language-processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

970

Forks

199

Language

C++

License

Apache-2.0

Related tools

githubharald/CTCDecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon...

githubharald/CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

nl8590687/ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

robmsmt/KerasDeepSpeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Explore Voice AI Tools

All categories Trending Voice AI directory Insights