edwko/OuteTTS
Interface for OuteTTS models.
This tool helps individuals or businesses convert written text into natural-sounding speech. You input text and a short audio sample of a desired voice, and it outputs an audio file speaking your text in that voice. It's ideal for content creators, educators, or anyone needing custom voiceovers or audio content.
1,429 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to generate high-quality, custom voice narration from text, especially if you want the generated speech to match a specific speaker's voice, emotion, style, and accent.
Not ideal if you need a simple text-to-speech solution without custom voice cloning or if your target language is not well-represented by your speaker reference.
Stars
1,429
Forks
113
Language
Python
License
Apache-2.0
Category
Last pushed
Jun 21, 2025
Commits (30d)
0
Dependencies
28
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/edwko/OuteTTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
fluxions-ai/vui
100M parameter lightweight conversational text-to-speech model with breaths, laughter,...
OpenVoiceOS/ovos-audio-transformer-plugin-ggwave
data over sound plugin
inboxpraveen/LLM-Minutes-of-Meeting
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates...
mbzuai-oryx/LLMVoX
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Aratako/T5Gemma-TTS
Multilingual TTS model with voice cloning and duration control, based on T5Gemma encoder-decoder LLM