Text to Speech TTS Transformer Models
Tools for converting written text into spoken audio using transformer models and neural vocoding. Includes TTS engines, voice synthesis systems, and voice cloning capabilities. Does NOT include speech recognition, speech-to-text, audio classification, or general audio processing without text input.
There are 23 text to speech tts models tracked. 3 score above 50 (established tier). The highest-rated is edwko/OuteTTS at 55/100 with 1,429 stars.
Get all 23 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=text-to-speech-tts&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
edwko/OuteTTS
Interface for OuteTTS models. |
|
Established |
| 2 |
fluxions-ai/vui
100M parameter lightweight conversational text-to-speech model with breaths,... |
|
Established |
| 3 |
OpenVoiceOS/ovos-audio-transformer-plugin-ggwave
data over sound plugin |
|
Established |
| 4 |
inboxpraveen/LLM-Minutes-of-Meeting
🎤📄 An innovative tool that transforms audio or video files into text... |
|
Emerging |
| 5 |
mbzuai-oryx/LLMVoX
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM |
|
Emerging |
| 6 |
Aratako/T5Gemma-TTS
Multilingual TTS model with voice cloning and duration control, based on... |
|
Emerging |
| 7 |
maciekt07/Lecture-Note-Generator-POC
📒 A proof-of-concept app that transcribes lecture recordings into text and... |
|
Emerging |
| 8 |
skjp/spout
Workspace Repo for Synergistic Plugins Optimizing Usability of Transformers(Spout) |
|
Emerging |
| 9 |
tahaabbas/dictator
Dictator – Supercharge Cursor Chat with voice-to-text, custom AI prompts,... |
|
Emerging |
| 10 |
arifulislamat/local-voice-cloning-app
Powered by ChatterboxTTS | Transformer | Llama | Gradio |
|
Experimental |
| 11 |
mwasifanwar/VoiceClone-Pro
Advanced voice cloning and speech synthesis system that can mimic any voice... |
|
Experimental |
| 12 |
jaden3289/llasa-tts-8b-webui
🎙️ Generate high-quality speech from text with Llasa-TTS-8B, featuring... |
|
Experimental |
| 13 |
Swap98-Coder/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS)... |
|
Experimental |
| 14 |
eray-yuztyurk/python-ai-text-to-speech
Multilingual text-to-speech and text summarization toolkit using... |
|
Experimental |
| 15 |
fahimakhalifa/ai-notes-api
Authenticated Notes API with Hugging Face summarization, sentiment analysis,... |
|
Experimental |
| 16 |
oscargullberg/tldwol
Web API that summarizes multimedia from various sources using modern AI tools. |
|
Experimental |
| 17 |
yamanobora/Android-Offline-Meeting-Recorder
Android app for offline speech recognition and AI meeting summarization... |
|
Experimental |
| 18 |
arafat2020/cut_py
An R&D project to cut the best prat from a video using AI and ffmpeg. |
|
Experimental |
| 19 |
thewh1teagle/sheen
LLM based TTS using Qwen and the SNAC audio codec |
|
Experimental |
| 20 |
vijay0320/meeting-notes-cleaner
NLP pipeline fine-tuning flan-t5-small on meeting transcripts. 99.7%... |
|
Experimental |
| 21 |
Afhrodite/Audio-LLM-Playground
A collection of audio transcription and summarization tools developed during... |
|
Experimental |
| 22 |
nurgalive/nurgavoice
AI Transcription & Summarization Service build with open-source models. |
|
Experimental |
| 23 |
egorsmkv/speech-to-text-using-php
Use PHP for Speech-to-Text task. Just a research. |
|
Experimental |