Text To Speech Frameworks Voice AI Tools

There are 66 text to speech frameworks tools tracked. 20 score above 50 (established tier). The highest-rated is yeyupiaoling/MASR at 63/100 with 724 stars.

Get all 66 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=text-to-speech-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Tool	Score	Tier	Stars	Language
1	yeyupiaoling/MASR Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2...	63	Established	724	Python
2	shivammehta25/Matcha-TTS [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching	59	Established	1,259	Jupyter Notebook
3	coqui-ai/TTS 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research...	59	Established	44,801	Python
4	DigitalPhonetics/IMS-Toucan Controllable and fast Text-to-Speech for over 7000 languages!	58	Established	2,190	Python
5	gabrielmittag/NISQA NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment	58	Established	917	Python
6	shivammehta25/Neural-HMM Neural HMMs are all you need (for high-quality attention-free TTS)	54	Established	164	Jupyter Notebook
7	netease-youdao/EmotiVoice EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine	54	Established	8,455	Python
8	spring-media/TransformerTTS 🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based...	51	Established	1,161	Python
9	keithito/tacotron A TensorFlow implementation of Google's Tacotron speech synthesis with...	51	Established	2,988	Python
10	soobinseo/Transformer-TTS A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"	51	Established	690	Python
11	jaywalnut310/glow-tts A Generative Flow for Text-to-Speech via Monotonic Alignment Search	51	Established	704	Python
12	descriptinc/melgan-neurips GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis	51	Established	1,037	Python
13	jik876/hifi-gan HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity...	51	Established	2,328	Python
14	r9y9/deepvoice3_pytorch PyTorch implementation of convolutional neural networks-based text-to-speech...	51	Established	1,982	Python
15	xcmyz/FastSpeech The Implementation of FastSpeech based on pytorch.	51	Established	880	Python
16	jackaduma/CycleGAN-VC2 Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2	50	Established	571	Python
17	jaywalnut310/vits VITS: Conditional Variational Autoencoder with Adversarial Learning for...	50	Established	7,837	Python
18	israelg99/deepvoice Deep Voice: Real-time Neural Text-to-Speech	50	Established	364	Python
19	yl4579/StarGANv2-VC StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for...	50	Established	518	Python
20	svc-develop-team/so-vits-svc SoftVC VITS Singing Voice Conversion	50	Established	28,008	Python
21	tugstugi/pytorch-dc-tts Text to Speech with PyTorch (English and Mongolian)	49	Emerging	187	Jupyter Notebook
22	NevilPatel01/RVC-WebUI-MacOS Optimized Retrieval-based Voice Conversion WebUI for Apple Silicon Macs...	49	Emerging	31	Python
23	p0p4k/vits2_pytorch unofficial vits2-TTS implementation in pytorch	49	Emerging	547	Python
24	metavoiceio/metavoice-src Foundational model for human-like, expressive TTS	49	Emerging	4,201	Python
25	google/tacotron Audio samples accompanying publications related to Tacotron, an end-to-end...	49	Emerging	539	HTML
26	gooofy/zerovox zero-shot realtime TTS system, fully offline, free and open source	48	Emerging	51	Python
27	jpuigcerver/Laia Laia: A deep learning toolkit for HTR based on Torch	48	Emerging	151	Shell
28	mozilla/TTS :robot: :speech_balloon: Deep learning for Text to Speech (Discussion...	48	Emerging	10,123	Jupyter Notebook
29	LEEYOONHYUNG/BVAE-TTS Official implementation of BVAE-TTS	47	Emerging	173	Python
30	yl4579/StyleTTS Official Implementation of StyleTTS	46	Emerging	462	Python
31	ishandutta2007/Awesome-Text-to-Speech 🎤 A curated list of the latest and most influential tools, models, and...	46	Emerging	95	—
32	pritishyuvraj/Voice-Conversion-GAN Voice Conversion using Cycle GAN's For Non-Parallel Data	46	Emerging	125	Jupyter Notebook
33	nipponjo/tts-arabic-pytorch 🎙️ Arabic TTS models (Tacotron2, FastPitch)	45	Emerging	137	Jupyter Notebook
34	nnsvs/nnsvs Neural network-based singing voice synthesis library for research	45	Emerging	742	Python
35	daniilrobnikov/vits2 VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with...	45	Emerging	634	Jupyter Notebook
36	spring-media/DeepPhonemizer Grapheme to phoneme conversion with deep learning.	45	Emerging	421	Python
37	maum-ai/univnet Unofficial PyTorch Implementation of UnivNet Vocoder...	45	Emerging	282	Python
38	coqui-ai/TTS-papers 🐸 collection of TTS papers	44	Emerging	723	—
39	persephone-tools/persephone A tool for automatic phoneme transcription	44	Emerging	159	Python
40	r9y9/ttslearn ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)	44	Emerging	267	Jupyter Notebook
41	keonlee9420/Comprehensive-Transformer-TTS A Non-Autoregressive Transformer based Text-to-Speech, supporting a family...	44	Emerging	328	Python
42	maum-ai/assem-vc Official Code for Assem-VC @ICASSP2022	44	Emerging	269	Jupyter Notebook
43	p0p4k/pflowtts_pytorch Unofficial implementation of NVIDIA P-Flow TTS paper	43	Emerging	230	Python
44	karim23657/Persian-tts-coqui Persian/Farsi text to speech(TTS) training using coqui tts	43	Emerging	199	Jupyter Notebook
45	yl4579/StyleTTS-VC Official Implementation of StyleTTS-VC	43	Emerging	197	Python
46	keonlee9420/Comprehensive-Tacotron2 PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning...	42	Emerging	48	Python
47	huckiyang/Voice2Series-Reprogramming ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time...	41	Emerging	73	TypeScript
48	hhguo/MSMC-TTS Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS	41	Emerging	169	Python
49	yl4579/HiFTNet HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter...	41	Emerging	247	Python
50	sophiefy/StellaVoiceChanger Deep-learning-based voice changer, supporting local inference.	40	Emerging	96	Python
51	double22a/asr_nlp_paper_code Papers of ASR, Tools of ASR	40	Emerging	41	—
52	SungFeng-Huang/Meta-TTS Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More...	39	Emerging	194	Python
53	alessandroragano/scoreq SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)	39	Emerging	108	Python
54	binzhouchn/masr 中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。	37	Emerging	285	Python
55	HuuHuy227/XphoneBert_Vits2 VITS2 extended with XPhoneBERT encoder	35	Emerging	10	Python
56	jreremy/conformer Pytorch implementation of conformer with with training script for end-to-end...	35	Emerging	28	Python
57	keonlee9420/Comprehensive-E2E-TTS A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a...	34	Emerging	146	Python
58	nafiuny/ICRCycleGAN-VC Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and...	33	Emerging	15	Python
59	ShawnPi233/HQ-SVC Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice...	32	Emerging	91	Python
60	zmeet-ai/tts-demo 支持各种感情的男女声音，支持实时和离线文本合成tts语音；支持单模特声音变声，语音速率调整，语音音量大小调整；支持自定义语音模型。	31	Emerging	70	Java
61	sil-ai/tts-singlish TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.	27	Experimental	11	Python
62	juanjosehr14/YingMusic-SVC 🎤 Transform singing voices effortlessly with YingMusic-SVC, a robust...	22	Experimental	1	—
63	mende237/Nda-Nda-Force-Aligner Forced alignment of Nda‘ Nda’ a Cameroonian language	21	Experimental	3	Shell
64	MahdeenSky/SoftVC-VITS-MusicSingerChanger Google collab for testing SoftVC VITS Singing Voice Conversion for AI...	19	Experimental	13	Jupyter Notebook
65	felipeoliverai/conformer-paper PyTorch implementation of the paper: 𝐂𝐨𝐧𝐟𝐨𝐫𝐦𝐞𝐫: 𝐂𝐨𝐧𝐯𝐨𝐥𝐮𝐭𝐢𝐨𝐧-𝐚𝐮𝐠𝐦𝐞𝐧𝐭𝐞𝐝...	17	Experimental	1	Python
66	nipponjo/mixer-tts-pytorch Mixer-TTS for efficient TTS	12	Experimental	5	Jupyter Notebook

Comparisons in this category

TTS and glow-tts (59 vs 51) deepvoice3_pytorch and deepvoice (51 vs 50) TransformerTTS and Transformer-TTS (51 vs 51) CycleGAN-VC2 and StarGANv2-VC (50 vs 50)