upskyy/Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
This project provides an optimized way to build automatic speech recognition (ASR) systems. It takes raw audio data or audio features as input and outputs text transcriptions more efficiently than previous methods. This is for machine learning engineers or researchers who are building or improving speech-to-text models and need faster processing for long audio sequences.
148 stars. No commits in the last 6 months. Available on PyPI.
Use this if you are developing an automatic speech recognition system and need to process long audio inputs more efficiently.
Not ideal if you are a non-technical end-user looking for a ready-to-use speech-to-text application.
Stars
148
Forks
16
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 22, 2022
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/upskyy/Squeezeformer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
khanld/chunkformer
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech...
WindQAQ/listen-attend-and-spell
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project...
jackaduma/LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
kaituoxu/Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.