msalhab96/Listen-Attend-and-Spell
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
This project helps developers working with speech recognition by providing a PyTorch implementation of the Listen, Attend and Spell (LAS) model. It takes audio files and their corresponding text transcripts as input to train a model that can convert spoken words into written text. This is for machine learning engineers or researchers building custom speech-to-text systems.
No commits in the last 6 months.
Use this if you are a machine learning engineer who needs to train a custom speech recognition model from scratch using your own audio and text data.
Not ideal if you are an end-user looking for a ready-to-use speech-to-text application or do not have experience with PyTorch and model training.
Stars
12
Forks
1
Language
Python
License
—
Category
Last pushed
Mar 04, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/msalhab96/Listen-Attend-and-Spell"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
khanld/chunkformer
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech...
upskyy/Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech...
WindQAQ/listen-attend-and-spell
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project...
jackaduma/LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)