Blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

80
/ 100
Verified

This project helps content creators, educators, and businesses generate high-quality spoken audio from text, transcribe spoken audio into text, or even transform one voice into another. You provide written text or spoken audio, and it produces new audio or text transcripts. It's designed for anyone needing fast, efficient audio processing directly on their Apple computer.

6,227 stars. Used by 4 other packages. Actively maintained with 77 commits in the last 30 days. Available on PyPI.

Use this if you need to quickly convert text to natural-sounding speech, transcribe audio recordings, or change speech characteristics using your Apple Silicon-powered Mac.

Not ideal if you need to perform complex audio editing, music production, or if you are not using an Apple computer with an M-series chip.

content-creation audio-transcription voice-generation multilingual-communication digital-accessibility
Maintenance 22 / 25
Adoption 14 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

6,227

Forks

486

Language

Python

License

MIT

Last pushed

Mar 12, 2026

Commits (30d)

77

Dependencies

12

Reverse dependents

4

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Blaizzy/mlx-audio"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.