Blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

/ 100

Verified

This project helps content creators, educators, and businesses generate high-quality spoken audio from text, transcribe spoken audio into text, or even transform one voice into another. You provide written text or spoken audio, and it produces new audio or text transcripts. It's designed for anyone needing fast, efficient audio processing directly on their Apple computer.

6,227 stars. Used by 4 other packages. Actively maintained with 77 commits in the last 30 days. Available on PyPI.

Use this if you need to quickly convert text to natural-sounding speech, transcribe audio recordings, or change speech characteristics using your Apple Silicon-powered Mac.

Not ideal if you need to perform complex audio editing, music production, or if you are not using an Apple computer with an M-series chip.

content-creation audio-transcription voice-generation multilingual-communication digital-accessibility

Maintenance 22 / 25

Adoption 14 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

6,227

Forks

486

Language

Python

License

MIT

Featured in

Things AI Won't Tell You About Building a Voice App Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Recent Releases

v0.4.2 30 Mar 2026 v0.4.1 14 Mar 2026 v0.4.0 07 Mar 2026 v0.3.1 29 Jan 2026 v0.3.0 25 Jan 2026

Related tools

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server...

fishaudio/fish-speech

SOTA Open Source TTS

sidharthrajaram/StyleTTS2

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

mlalma/kokoro-ios

Kokoro TTS for iOS and macOSX

mlalma/KokoroTestApp

Test application for Kokoro TTS model

Explore Voice AI Tools

All categories Trending Voice AI directory Insights