upskyy/Squeezeformer

PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)

/ 100

Emerging

This project provides an optimized way to build automatic speech recognition (ASR) systems. It takes raw audio data or audio features as input and outputs text transcriptions more efficiently than previous methods. This is for machine learning engineers or researchers who are building or improving speech-to-text models and need faster processing for long audio sequences.

148 stars. No commits in the last 6 months. Available on PyPI.

Use this if you are developing an automatic speech recognition system and need to process long audio inputs more efficiently.

Not ideal if you are a non-technical end-user looking for a ready-to-use speech-to-text application.

automatic-speech-recognition speech-to-text natural-language-processing audio-transcription machine-learning-engineering

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 13 / 25

How are scores calculated?

Stars

148

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

khanld/chunkformer

ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription

sooftware/conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech...

WindQAQ/listen-attend-and-spell

Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project...

jackaduma/LAS_Mandarin_PyTorch

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

kaituoxu/Listen-Attend-Spell

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights