Frikallo/parakeet.cpp

Ultra fast and portable Parakeet implementation for on-device inference in C++ using Axiom with MPS+Unified Memory

41
/ 100
Emerging

This project offers extremely fast and efficient speech recognition directly on devices, particularly Apple Silicon. It takes audio files in various formats (like WAV, MP3, FLAC) and converts them into text, including word-by-word timestamps and speaker identification. This tool is ideal for developers building applications that need real-time or high-volume audio transcription without relying on cloud services or large, complex runtimes.

244 stars.

Use this if you are a developer creating applications that require ultra-fast, on-device audio transcription, speaker diarization, or streaming speech-to-text with minimal latency, especially on Apple Silicon hardware.

Not ideal if you are looking for a ready-to-use application with a graphical user interface or if your primary development environment is not C++.

audio-transcription speech-recognition speaker-diarization on-device-AI edge-computing
No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 11 / 25
Community 7 / 25

How are scores calculated?

Stars

244

Forks

7

Language

C++

License

MIT

Last pushed

Mar 15, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Frikallo/parakeet.cpp"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.