kaituoxu/Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
This project helps machine learning engineers or researchers build custom automatic speech recognition (ASR) systems. It takes raw acoustic features from audio and converts them directly into sequences of characters. The output is a trained model capable of transcribing spoken language.
207 stars. No commits in the last 6 months.
Use this if you are a machine learning engineer or researcher developing custom ASR models and need a robust, end-to-end framework.
Not ideal if you are a general user looking for a ready-to-use speech-to-text application without needing to train models.
Stars
207
Forks
56
Language
Python
License
—
Category
Last pushed
Jan 08, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kaituoxu/Listen-Attend-Spell"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
khanld/chunkformer
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech...
upskyy/Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech...
WindQAQ/listen-attend-and-spell
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project...
jackaduma/LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)