edwko/OuteTTS

Interface for OuteTTS models.

/ 100

Established

This tool helps individuals or businesses convert written text into natural-sounding speech. You input text and a short audio sample of a desired voice, and it outputs an audio file speaking your text in that voice. It's ideal for content creators, educators, or anyone needing custom voiceovers or audio content.

1,429 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to generate high-quality, custom voice narration from text, especially if you want the generated speech to match a specific speaker's voice, emotion, style, and accent.

Not ideal if you need a simple text-to-speech solution without custom voice cloning or if your target language is not well-represented by your speaker reference.

voice-generation audio-content-creation narration multilingual-audio voice-cloning

Stale 6m

Maintenance 2 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 18 / 25

How are scores calculated?

Stars

1,429

Forks

113

Language

Python

License

Apache-2.0

Related models

fluxions-ai/vui

100M parameter lightweight conversational text-to-speech model with breaths, laughter,...

OpenVoiceOS/ovos-audio-transformer-plugin-ggwave

data over sound plugin

inboxpraveen/LLM-Minutes-of-Meeting

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates...

mbzuai-oryx/LLMVoX

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Aratako/T5Gemma-TTS

Multilingual TTS model with voice cloning and duration control, based on T5Gemma encoder-decoder LLM

Explore Transformer Models

All categories Trending Transformer directory Insights