TeaPoly/Conformer-Athena

Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.

/ 100

Emerging

This is a comprehensive toolkit for building and training advanced speech processing models. It takes raw audio files and corresponding transcripts as input and outputs trained models capable of tasks like transcribing speech, generating speech, or recognizing speakers. It's designed for machine learning engineers and researchers focused on developing and improving end-to-end speech AI systems.

No commits in the last 6 months.

Use this if you need a flexible, open-source framework to develop, train, and deploy custom models for various speech AI tasks, from automatic speech recognition to voice synthesis.

Not ideal if you're looking for a simple, out-of-the-box solution to transcribe audio without any model training or development.

speech-recognition speech-synthesis voice-conversion speaker-recognition machine-learning-engineering

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

khanld/chunkformer

ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription

sooftware/conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech...

upskyy/Squeezeformer

PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech...

WindQAQ/listen-attend-and-spell

Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project...

jackaduma/LAS_Mandarin_PyTorch

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights