FastSpeech TTS Models Voice AI Tools

PyTorch implementations and variants of FastSpeech and FastSpeech2 architectures for neural text-to-speech synthesis. Does NOT include other TTS architectures (Transformer-TTS, Glow-TTS), vocoder implementations, or non-FastSpeech based speech synthesis models.

There are 74 fastspeech tts models tools tracked. 3 score above 50 (established tier). The highest-rated is TensorSpeech/TensorFlowTTS at 60/100 with 3,995 stars.

Get all 74 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=fastspeech-tts-models&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art...

60
Established
2 lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

52
Established
3 Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with...

50
Established
4 keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and...

48
Emerging
5 jxzhanggg/nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

48
Emerging
6 rishikksh20/FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End...

48
Emerging
7 yl4579/PL-BERT

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

48
Emerging
8 saiteja-talluri/Speech2Face

Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face...

47
Emerging
9 roatienza/efficientspeech

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

47
Emerging
10 rishikksh20/AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

47
Emerging
11 atomicoo/tacotron2-mandarin

Tensorflow implementation of Chinese/Mandarin TTS (Text-to-Speech) based on...

47
Emerging
12 keonlee9420/Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional,...

46
Emerging
13 ORI-Muchim/Efficient-Speech

Lightweight Korean TTS Model based on FastSpeech2

46
Emerging
14 neosapience/mlp-singer

Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing...

46
Emerging
15 atomicoo/FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese,...

46
Emerging
16 CSTR-Edinburgh/magphase

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

45
Emerging
17 KevinMIN95/StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

45
Emerging
18 NATSpeech/NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official...

45
Emerging
19 caizexin/tf_multispeakerTTS_fc

the Tensorflow version of multi-speaker TTS training with feedback constraint

43
Emerging
20 Rongjiehuang/GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model...

43
Emerging
21 Rongjiehuang/Multi-Singer

PyTorch Implementation of Multi-Singer (ACM-MM'21)

42
Emerging
22 keonlee9420/Daft-Exprt

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across...

42
Emerging
23 keonlee9420/FastPitchFormant

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based...

42
Emerging
24 mush42/optispeech

A lightweight end-to-end text-to-speech model

42
Emerging
25 keonlee9420/StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive...

42
Emerging
26 neosapience/editts

Official implementation of EdiTTS: Score-based Editing for Controllable...

42
Emerging
27 ranchlai/mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 ,...

42
Emerging
28 Labmem-Zhouyx/CDFSE_FastSpeech2

The Official Implementation of “Content-Dependent Fine-Grained Speaker...

40
Emerging
29 yui-mhcp/text_to_speech

(Multi Speaker) Text-To-Speech (TTS) project

40
Emerging
30 hwRG/End-to-End-TTS-Fine-Tune

Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.

40
Emerging
31 lucasnewman/vocos-mlx

Implementation of 'Vocos: Closing the gap between time-domain and...

40
Emerging
32 gmltmd789/UnitSpeech

An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis...

39
Emerging
33 Executedone/Chinese-FastSpeech2

基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏

39
Emerging
34 lars76/fastspeech2-clean

Clean and modernized implementation of FastSpeech2/LightSpeech using IPA

38
Emerging
35 andi611/ZeroSpeech-TTS-without-T

A Pytorch implementation for the ZeroSpeech 2019 challenge.

38
Emerging
36 msalhab96/MultiSpeech

pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with...

37
Emerging
37 Adibian/ResGrad

Unofficial implementation of ResGrad: Residual Denoising Diffusion...

37
Emerging
38 tuanh123789/AdaSpeech

An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for...

37
Emerging
39 ga642381/FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to...

36
Emerging
40 rishikksh20/LightSpeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

36
Emerging
41 adasegroup/OSM-one-shot-multispeaker

Framework for one-shot multispeaker system based on Deep Learning

34
Emerging
42 OpenTSLab/BELLE

Official implementation of BELLE "Bayesian Speech Synthesizers Can Learn...

34
Emerging
43 akashmjn/cs224n-gpu-that-talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

34
Emerging
44 ShivamRajSharma/Transformer-Text-To-Speech

Pytorch implementation of Transformer-TTS for converting text into speech.

34
Emerging
45 deepkyu/ml-talking-face

Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)

34
Emerging
46 eazhary/dctts2

Deep Convolution Text to Speech

33
Emerging
47 lucasnewman/e2-tts-mlx

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive...

33
Emerging
48 revsic/tf-glow-tts

Tensorflow implementation of Glow-TTS

33
Emerging
49 revsic/tf-mlptts

Tensorflow implementation of MLP-Mixer based TTS

33
Emerging
50 hwRG/FastSpeech2-Pytorch-Korean-Multi-Speaker

Multi-Speaker FastSpeech2 applicable to Korean. Description about train and...

33
Emerging
51 mush42/leanspeech

Unofficial pytorch implementation of LeanSpeech: The Microsoft Lightweight...

32
Emerging
52 yanghaha0908/FastHuBERT

Official implementation for Fast-HuBERT: An Efficient Training Framework for...

32
Emerging
53 dacson/Demo-of-Text-to-Speech-based-on-Deep-Learning

text to speech for mandarin,

31
Emerging
54 erogol/ddc-samples

🐸💬 Coqui TTS Double Decoder Consistency samples

31
Emerging
55 xcmyz/FastSpeech2

The Implementation of FastSpeech2 Based on Pytorch.

31
Emerging
56 X-LANCE/VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient...

31
Emerging
57 AppleHolic/FastSpeech2

Refactored version of https://github.com/ming024/FastSpeech2

31
Emerging
58 QinHsiu/BiCLTTS

Bi-level Cntrastive Learning for Text-to-Speech

29
Experimental
59 X-LANCE/UniCATS-CTX-txt2vec

[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS

29
Experimental
60 aqibahmad/speech2face

A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE...

29
Experimental
61 ssmlkl/MnTTS2

This is the experimental description of MnTTS2.

28
Experimental
62 monatis/german-tts

German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support

28
Experimental
63 carankt/FastSpeech2

Implementation of FastSpeech 2

26
Experimental
64 WWWWxp/M3-TTS

Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment &...

26
Experimental
65 ssumin6/Korean-TTS-Server

Korean text-to-speech

25
Experimental
66 erogol/TTS_tf

WIP Tensorflow implementation of https://github.com/mozilla/TTS

24
Experimental
67 clarenceluo78/singer-adaptive-svc

This repository is the implementation of project Converting to Realistic...

24
Experimental
68 Orca0917/TransformerTTS

Unofficial PyTorch implementation of Transformer-TTS, a Transformer-based...

22
Experimental
69 keonlee9420/Deep-Learning-TTS-Template

This is a template for the Non-autoregressive Deep Learning-Based TTS model...

21
Experimental
70 kowaalczyk/reformer-tts

An adaptation of Reformer: The Efficient Transformer for text-to-speech task.

21
Experimental
71 zabir-nabil/fast-wavenet-mel2wav

Dummy Implementation, Will update later

17
Experimental
72 asiff00/TTS-Training-Blueprint

Intuitive understanding of Autoregressive TTS Models

16
Experimental
73 gateoneh92/Flow-Matching-TTS

⚡ Non-autoregressive TTS using Conditional Flow Matching - 5-20x faster than...

14
Experimental
74 davidalvarezdlt/samplernn_pase

Implementation of the paper "Problem-agnostic speech embeddings for...

11
Experimental