TeaPoly/Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
This is a comprehensive toolkit for building and training advanced speech processing models. It takes raw audio files and corresponding transcripts as input and outputs trained models capable of tasks like transcribing speech, generating speech, or recognizing speakers. It's designed for machine learning engineers and researchers focused on developing and improving end-to-end speech AI systems.
No commits in the last 6 months.
Use this if you need a flexible, open-source framework to develop, train, and deploy custom models for various speech AI tasks, from automatic speech recognition to voice synthesis.
Not ideal if you're looking for a simple, out-of-the-box solution to transcribe audio without any model training or development.
Stars
44
Forks
8
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 02, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/TeaPoly/Conformer-Athena"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
khanld/chunkformer
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech...
upskyy/Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech...
WindQAQ/listen-attend-and-spell
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project...
jackaduma/LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)