Text To Speech Frameworks Voice AI Tools

There are 66 text to speech frameworks tools tracked. 20 score above 50 (established tier). The highest-rated is yeyupiaoling/MASR at 63/100 with 724 stars.

Get all 66 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=text-to-speech-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 yeyupiaoling/MASR

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2...

63
Established
2 shivammehta25/Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

59
Established
3 coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research...

59
Established
4 DigitalPhonetics/IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

58
Established
5 gabrielmittag/NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

58
Established
6 shivammehta25/Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

54
Established
7 netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

54
Established
8 spring-media/TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...

51
Established
9 keithito/tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with...

51
Established
10 soobinseo/Transformer-TTS

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

51
Established
11 jaywalnut310/glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

51
Established
12 descriptinc/melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

51
Established
13 jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity...

51
Established
14 r9y9/deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech...

51
Established
15 xcmyz/FastSpeech

The Implementation of FastSpeech based on pytorch.

51
Established
16 jackaduma/CycleGAN-VC2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

50
Established
17 jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for...

50
Established
18 israelg99/deepvoice

Deep Voice: Real-time Neural Text-to-Speech

50
Established
19 yl4579/StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for...

50
Established
20 svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

50
Established
21 tugstugi/pytorch-dc-tts

Text to Speech with PyTorch (English and Mongolian)

49
Emerging
22 NevilPatel01/RVC-WebUI-MacOS

Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs...

49
Emerging
23 p0p4k/vits2_pytorch

unofficial vits2-TTS implementation in pytorch

49
Emerging
24 metavoiceio/metavoice-src

Foundational model for human-like, expressive TTS

49
Emerging
25 google/tacotron

Audio samples accompanying publications related to Tacotron, an end-to-end...

49
Emerging
26 gooofy/zerovox

zero-shot realtime TTS system, fully offline, free and open source

48
Emerging
27 jpuigcerver/Laia

Laia: A deep learning toolkit for HTR based on Torch

48
Emerging
28 mozilla/TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion...

48
Emerging
29 LEEYOONHYUNG/BVAE-TTS

Official implementation of BVAE-TTS

47
Emerging
30 yl4579/StyleTTS

Official Implementation of StyleTTS

46
Emerging
31 ishandutta2007/Awesome-Text-to-Speech

🎤 A curated list of the latest and most influential tools, models, and...

46
Emerging
32 pritishyuvraj/Voice-Conversion-GAN

Voice Conversion using Cycle GAN's For Non-Parallel Data

46
Emerging
33 nipponjo/tts-arabic-pytorch

🎙️ Arabic TTS models (Tacotron2, FastPitch)

45
Emerging
34 nnsvs/nnsvs

Neural network-based singing voice synthesis library for research

45
Emerging
35 daniilrobnikov/vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with...

45
Emerging
36 spring-media/DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

45
Emerging
37 maum-ai/univnet

Unofficial PyTorch Implementation of UnivNet Vocoder...

45
Emerging
38 coqui-ai/TTS-papers

🐸 collection of TTS papers

44
Emerging
39 persephone-tools/persephone

A tool for automatic phoneme transcription

44
Emerging
40 r9y9/ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

44
Emerging
41 keonlee9420/Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family...

44
Emerging
42 maum-ai/assem-vc

Official Code for Assem-VC @ICASSP2022

44
Emerging
43 p0p4k/pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

43
Emerging
44 karim23657/Persian-tts-coqui

Persian/Farsi text to speech(TTS) training using coqui tts

43
Emerging
45 yl4579/StyleTTS-VC

Official Implementation of StyleTTS-VC

43
Emerging
46 keonlee9420/Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning...

42
Emerging
47 huckiyang/Voice2Series-Reprogramming

ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time...

41
Emerging
48 hhguo/MSMC-TTS

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

41
Emerging
49 yl4579/HiFTNet

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter...

41
Emerging
50 sophiefy/StellaVoiceChanger

Deep-learning-based voice changer, supporting local inference.

40
Emerging
51 double22a/asr_nlp_paper_code

Papers of ASR, Tools of ASR

40
Emerging
52 SungFeng-Huang/Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More...

39
Emerging
53 alessandroragano/scoreq

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

39
Emerging
54 binzhouchn/masr

中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。

37
Emerging
55 HuuHuy227/XphoneBert_Vits2

VITS2 extended with XPhoneBERT encoder

35
Emerging
56 jreremy/conformer

Pytorch implementation of conformer with with training script for end-to-end...

35
Emerging
57 keonlee9420/Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a...

34
Emerging
58 nafiuny/ICRCycleGAN-VC

Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and...

33
Emerging
59 ShawnPi233/HQ-SVC

Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice...

32
Emerging
60 zmeet-ai/tts-demo

支持各种感情的男女声音,支持实时和离线文本合成tts语音;支持单模特声音变声,语音速率调整,语音音量大小调整;支持自定义语音模型。

31
Emerging
61 sil-ai/tts-singlish

TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.

27
Experimental
62 juanjosehr14/YingMusic-SVC

🎤 Transform singing voices effortlessly with YingMusic-SVC, a robust...

22
Experimental
63 mende237/Nda-Nda-Force-Aligner

Forced alignment of Nda‘ Nda’ a Cameroonian language

21
Experimental
64 MahdeenSky/SoftVC-VITS-MusicSingerChanger

Google collab for testing SoftVC VITS Singing Voice Conversion for AI...

19
Experimental
65 felipeoliverai/conformer-paper

PyTorch implementation of the paper: 𝐂𝐨𝐧𝐟𝐨𝐫𝐦𝐞𝐫: 𝐂𝐨𝐧𝐯𝐨𝐥𝐮𝐭𝐢𝐨𝐧-𝐚𝐮𝐠𝐦𝐞𝐧𝐭𝐞𝐝...

17
Experimental
66 nipponjo/mixer-tts-pytorch

Mixer-TTS for efficient TTS

12
Experimental