Blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
This project helps content creators, educators, and businesses generate high-quality spoken audio from text, transcribe spoken audio into text, or even transform one voice into another. You provide written text or spoken audio, and it produces new audio or text transcripts. It's designed for anyone needing fast, efficient audio processing directly on their Apple computer.
6,227 stars. Used by 4 other packages. Actively maintained with 77 commits in the last 30 days. Available on PyPI.
Use this if you need to quickly convert text to natural-sounding speech, transcribe audio recordings, or change speech characteristics using your Apple Silicon-powered Mac.
Not ideal if you need to perform complex audio editing, music production, or if you are not using an Apple computer with an M-series chip.
Stars
6,227
Forks
486
Language
Python
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
77
Dependencies
12
Reverse dependents
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Blaizzy/mlx-audio"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Recent Releases
Related tools
lenML/Speech-AI-Forge
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server...
fishaudio/fish-speech
SOTA Open Source TTS
sidharthrajaram/StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
mlalma/kokoro-ios
Kokoro TTS for iOS and macOSX
mlalma/KokoroTestApp
Test application for Kokoro TTS model