lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
This tool helps you quickly transform written text into natural-sounding speech. You input plain text, and it generates an audio file of that text being spoken. It can even mimic a specific voice from an audio sample you provide. This is ideal for content creators, podcasters, educators, or anyone needing to convert written content into audio.
611 stars. Used by 2 other packages. No commits in the last 6 months. Available on PyPI.
Use this if you need to generate high-quality, natural-sounding speech from text quickly, potentially matching a specific voice, without needing extensive setup.
Not ideal if you require fine-grained control over speech nuances like emotional expression, specific dialects, or complex multi-speaker dialogues, which might require more specialized audio production tools.
Stars
611
Forks
61
Language
Python
License
MIT
Category
Last pushed
Mar 19, 2025
Commits (30d)
0
Dependencies
12
Reverse dependents
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lucasnewman/f5-tts-mlx"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System
RaduBolbo/F5-TTS-Emotional-CFG
Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS