MingLunHan/CIF-PyTorch

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

/ 100

Emerging

This project helps speech recognition researchers and engineers convert raw audio recordings into text transcriptions more efficiently. It takes in speech audio data and outputs a sequence of text units, such as words or subwords. Researchers working on developing or improving automatic speech recognition (ASR) systems would use this to build faster and more accurate models.

No commits in the last 6 months.

Use this if you are developing an end-to-end speech recognition model and need to precisely control the alignment between speech input and text output without sacrificing speed.

Not ideal if you are looking for a ready-to-use, off-the-shelf speech-to-text application for general use, rather than a component for ASR model development.

speech-to-text audio-transcription ASR-model-development natural-language-processing machine-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2....

dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

srvk/eesen

The official repository of the Eesen project

Explore Voice AI Tools

All categories Trending Voice AI directory Insights