upskyy/Squeezeformer

PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)

48
/ 100
Emerging

This project provides an optimized way to build automatic speech recognition (ASR) systems. It takes raw audio data or audio features as input and outputs text transcriptions more efficiently than previous methods. This is for machine learning engineers or researchers who are building or improving speech-to-text models and need faster processing for long audio sequences.

148 stars. No commits in the last 6 months. Available on PyPI.

Use this if you are developing an automatic speech recognition system and need to process long audio inputs more efficiently.

Not ideal if you are a non-technical end-user looking for a ready-to-use speech-to-text application.

automatic-speech-recognition speech-to-text natural-language-processing audio-transcription machine-learning-engineering
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 13 / 25

How are scores calculated?

Stars

148

Forks

16

Language

Python

License

Apache-2.0

Last pushed

Nov 22, 2022

Commits (30d)

0

Dependencies

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/upskyy/Squeezeformer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.