End-to-End ASR Frameworks Voice AI Tools

PyTorch-based implementations of complete automatic speech recognition systems with integrated acoustic modeling, feature extraction, and decoding. Does NOT include ASR evaluation metrics, language models, individual components (vocoder, G2P), or non-PyTorch frameworks like Kaldi-only solutions.

There are 117 end-to-end asr frameworks tools tracked. 7 score above 50 (established tier). The highest-rated is TensorSpeech/TensorFlowASR at 62/100 with 1,005 stars.

Get all 117 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=end-to-end-asr-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 TensorSpeech/TensorFlowASR

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in...

62
Established
2 dangvansam/viet-asr

VietASR - Vietnamese Automatic Speech Recognition

61
Established
3 wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

57
Established
4 xinjli/allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

56
Established
5 srvk/eesen

The official repository of the Eesen project

51
Established
6 hirofumi0810/neural_sp

End-to-end ASR/LM implementation with PyTorch

51
Established
7 sooftware/kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition...

51
Established
8 Audio-WestlakeU/VINP

Official PyTorch implementation of 'VINP: Variational Bayesian Inference...

49
Emerging
9 yl4579/AuxiliaryASR

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

48
Emerging
10 gentaiscool/end2end-asr-pytorch

End-to-End Automatic Speech Recognition on PyTorch

48
Emerging
11 clovaai/ClovaCall

ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)

48
Emerging
12 openspeech-team/openspeech

Open-Source Toolkit for End-to-End Speech Recognition leveraging...

48
Emerging
13 voicekit-team/T-one

T-one is a high-performance streaming ASR pipeline for Russian, specialized...

47
Emerging
14 freewym/espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

46
Emerging
15 by2101/OpenASR

A pytorch based end2end speech recognition system.

45
Emerging
16 theblackcat102/edgedict

Working online speech recognition based on RNN Transducer. ( Trained model...

45
Emerging
17 hirofumi0810/asr_preprocessing

Python implementation of pre-processing for End-to-End speech recognition

44
Emerging
18 iamjanvijay/rnnt_decoder_cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

44
Emerging
19 ryanleary/patter

speech-to-text in pytorch

43
Emerging
20 nobody132/masr

中文语音识别; Mandarin Automatic Speech Recognition;

43
Emerging
21 George0828Zhang/torch_cif

A fast parallel PyTorch implementation of the "CIF: Continuous...

43
Emerging
22 GinoShun/Accent-Activation-Steering

Official code for "Activation Steering for Accent Adaptation in Speech...

43
Emerging
23 R1ckShi/AESRC2020

[ICASSP2021] Data preperation scripts, training pipeline and baseline...

43
Emerging
24 upskyy/Transformer-Transducer

PyTorch implementation of "Transformer Transducer: A Streamable Speech...

43
Emerging
25 kaituoxu/Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with...

43
Emerging
26 jinserk/pytorch-asr

ASR with PyTorch

42
Emerging
27 charlesliucn/awesome-end2end-asr

💬 A list of End-to-End speech recognition, including papers, codes and other...

42
Emerging
28 declare-lab/speech-adapters

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient...

41
Emerging
29 pika-online/AESRC2020

a deep accent recognition network

41
Emerging
30 tugstugi/mongolian-speech-recognition

Mongolian speech recognition with PyTorch

40
Emerging
31 awslabs/speech-representations

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

40
Emerging
32 zh217/torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

40
Emerging
33 1ytic/pytorch-edit-distance

Levenshtein edit-distance on PyTorch and CUDA

40
Emerging
34 1ytic/open_stt_e2e

PyTorch end-to-end speech recognition

39
Emerging
35 oleges1/quartznet-pytorch

Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]

39
Emerging
36 sooftware/speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

39
Emerging
37 DataXujing/ASR-paper

:fire: ASR教程: https://dataxujing.github.io/ASR-paper/

39
Emerging
38 tabahi/contexless-phonemes-CUPE

pytorch model for contexless-phoneme prediction from speech audio

39
Emerging
39 VITA-Group/Audio-Lottery

[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight,...

39
Emerging
40 HawkAaron/E2E-ASR

PyTorch Implementations for End-to-End Automatic Speech Recognition

38
Emerging
41 xingchensong/Speech-Transformer-tf2.0

transformer for ASR-systerm (via tensorflow2.0)

38
Emerging
42 ondrejklejch/learning_to_adapt

Coordinate-wise meta-learner for speaker adaptation of ASR models.

38
Emerging
43 HawkAaron/RNN-Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction...

38
Emerging
44 manhph2211/ViSR

This repo builds an end-to-end deep learning application that supports...

38
Emerging
45 Sundy1219/eesen-for-thchs30

ASR for Chinese Mandarin

37
Emerging
46 clarinsi/Slovene_ASR_e2e

Automatic Speech Recognition tool

37
Emerging
47 nemoramo/acoustic_model

This is a sub-repository in building to create acoustic model in Mandarin...

37
Emerging
48 vectominist/End-to-end-ASR-Pytorch-DLHLP

Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation...

37
Emerging
49 audioku/cross-accent-maml-asr

Meta-learning model agnostic (MAML) implementation for cross-accented ASR

37
Emerging
50 stevenhillis/awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

37
Emerging
51 sooftware/lightning-asr

Modular and extensible speech recognition library leveraging...

36
Emerging
52 vectominist/MiniASR

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

36
Emerging
53 vectominist/spin

Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for...

35
Emerging
54 MingLunHan/CIF-PyTorch

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech...

35
Emerging
55 sooftware/End-to-End-Speech-Recognition-Models

PyTorch implementation of automatic speech recognition models.

35
Emerging
56 daveshap/keras_asr

ASR experiment using Google's Universal Sentence Encoder

34
Emerging
57 ThetaOne-AI/HiKE

Hierarchical Korean-English Code-Switching Speech Recognition Benchmark...

34
Emerging
58 cdyangbo/end2endASR

implement end-to-end asr algorithm with tensorflow

34
Emerging
59 jiwidi/DeepSpeech-pytorch

Pytorch implementation for DeepSpeech 2.0

34
Emerging
60 RF5/transfusion-asr

Transcribing Speech with Multinomial Diffusion, training code and models.

33
Emerging
61 mravanelli/pytorch_MLP_for_ASR

This code implements a basic MLP for speech recognition. The MLP is trained...

33
Emerging
62 jindongwang/EasyEspnet

Making Espnet easier to use

33
Emerging
63 teamtee/LLM-ASR-Error-Correction

This is a framework for using large language models to improve ASR...

33
Emerging
64 biyoml/End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

32
Emerging
65 aalto-speech/speechbrain-cl

Implementation of different curriculum learning (CL) methods for...

32
Emerging
66 PigeonDan1/ps-slm

TASU: A New Style of Alignment of Speech LLM with only Text Training Data,...

31
Emerging
67 tongjinle123/speech-transformer-pytorch_lightning

ASR project with pytorch-lightning

31
Emerging
68 upskyy/ContextNet

PyTorch implementation of "ContextNet: Improving Convolutional Neural...

31
Emerging
69 viig99/esolafast

Fast C++ implementation of ESOLA using KFRLib, can be used for online...

31
Emerging
70 vectominist/rspin

Official inference code for NAACL 2024 paper "R-Spin: Efficient Speaker and...

31
Emerging
71 kouyt5/lightning-asr

基于pytorch-lighting框架搭建的端到端语音识别模型,目前还在实验中,性能在不断优化

31
Emerging
72 biyoml/PyTorch-End-to-End-ASR-on-TIMIT

Attention-based end-to-end ASR on TIMIT in PyTorch

30
Emerging
73 shockless/asr-transformer

Transformer for Automatic Speech Recognition

30
Emerging
74 DanielLin94144/Test-time-adaptation-ASR-SUTA

Test-time adaptation for speech recognition model by single utterance. The...

30
Emerging
75 nttcslab-sp/torchain

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

29
Experimental
76 lucadellalib/ts-asr

Target speaker automatic speech recognition (TS-ASR)

29
Experimental
77 sunprinceS/MetaASR-CrossAccent

Meta-Learning for End-to-End ASR

28
Experimental
78 SpringerNLP/Chapter12

Chapter 12: End-to-end Speech Recognition

28
Experimental
79 upskyy/Automatic-Speech-Recognition-Models

End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

28
Experimental
80 yinruiqing/tiny-transducer

Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices

28
Experimental
81 andybi7676/reborn-uasr

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training...

27
Experimental
82 erasedwalt/CTC-ASR

An implementation of Jasper, QuartzNet, Citrinet and pipeline for training...

27
Experimental
83 awasthiabhijeet/Error-Driven-ASR-Personalization

Code for "Error-driven Fixed-Budget ASR Personalization for Accented...

27
Experimental
84 Kirili4ik/QuartzNet-ASR-pytorch

Automatic Speech Recognition (ASR) model QuartzNet trained on English...

27
Experimental
85 emonosuke/emoASR

End-to-end MOdeling of ASR (Automatic Speech Recognition)

26
Experimental
86 TeaPoly/AIF-PyTorch

(NOT Official) Implementation Auto-regressive Integrate-and-Fire (AIF)

25
Experimental
87 tuanio/deepspeech-ctc

Deepspeech with ctc loss on Vivos Vietnamese Dataset

25
Experimental
88 aws-samples/seq2seq-asr-misbehaves

Artifacts for the paper "Attentional Speech Recognition Models Misbehave on...

25
Experimental
89 msalhab96/RNN-Transducer

PyTorch implementation of Sequence Transduction with Recurrent Neural...

24
Experimental
90 tuanio/e2e-asr-toolkit

E2E Speech Recognition Toolkit with Hydra and Pytorch Lightning

24
Experimental
91 umitkacar/transformer-asr-transcription

Real-time transformer-based ASR supporting 100+ languages - Google Cloud...

24
Experimental
92 gheyret/uyghur-asr-transformer

Speech Recognition for Uyghur using Speech transformer

24
Experimental
93 mict-zhaw/chall_e2e_stt

End-to-end ASR experiments for language learning, focusing on...

23
Experimental
94 DuyguA/TSD2025-Mind-the-Gap

Innovative ASR model to keep named entities intact, offered as a conference paper.

23
Experimental
95 AssemblyAI-Community/intro-to-espnet

Getting Started with ESPnet | AssemblyAI

22
Experimental
96 pragyak412/Improving-Voice-Separation-by-Incorporating-End-To-End-Speech-Recognition

Implementing the paper -

22
Experimental
97 Lakshmi-bashyam/NeuralLM2Arpa

Implementation of conversion system : Neural Language models to backing off...

22
Experimental
98 chrarvi/automatic-speech-recognition

An automatic speech recognition transformer for converting swedish voice to text.

21
Experimental
99 dobby-seo/kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

21
Experimental
100 1ytic/edit-distance-papers

A curated list of papers dedicated to edit-distance as objective function

21
Experimental
101 AppleHolic/2020AIChallengeSpeechRecognition

2020 AI Challenge 음성 인식 코드

20
Experimental
102 shahad-mahmud/incremental_learning_for_asr

Incremental learning for automatic speech recognition (ASR)

20
Experimental
103 xingchensong/ASR-Wavnet

some ASR-system implementations (via tensorflow 1.x)

20
Experimental
104 MorrisXu-Driving/Improving_DeepSpeech_2_by_RNN_Transducer_Pytorch_Implementation

In this repository, based on Deep Speech 2, two losses, CTC and RNN-T are compared.

20
Experimental
105 khaykingleb/automatic-speech-recognition

QuartzNet and DeepSpeech implementation for ASR

19
Experimental
106 zyascend/End-to-End-Speech-Recognition-Learning

ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别

19
Experimental
107 upskyy/RNN-Transducer

PyTorch Implementation of RNN-Transducer

19
Experimental
108 avrtt/MoE-speech-recognition

Mixture of experts architecture for speech-to-text and language...

19
Experimental
109 zw76859420/ASR_Transformer

A Pytorch implementation of Speech Transformer, an End-to-End Automatic...

18
Experimental
110 hannabdul/etf4asr

Official repo for the paper "An Effective Training Framework for...

13
Experimental
111 jfainberg/sincnet_adapt

Raw waveform adaptation with SincNet

13
Experimental
112 alifarrokh/asr-from-scratch

ASR models implemented from scratch in PyTorch

11
Experimental
113 muelitas/keywordRec

End-to-end speech recognition in Spanish and English using phonemes.

11
Experimental
114 anssssss/Vietnamese-Speech-Recognition

Vietnamese Speech Recognition using Connectionist Temporal Classification

11
Experimental
115 wjunneng/2020-Biendata-MagicSpeechNet-Family-Scene-Chinese-Speech-Data-Set-Challenge

2020 Biendata MagicSpeechNet 家庭场景中文语音数据集挑战赛

11
Experimental
116 hammaad2002/SimpleASRmodel

A simple CRDNN based ASR model for my own understanding of how ASR works and...

10
Experimental
117 molereddy/ditto-accent-asr

[ACL 2023] code for "DITTO: Data-efficient and Fair Targeted Subset...

10
Experimental