Dicklesworthstone/franken_whisper
Agent-first Rust ASR orchestration stack: Bayesian backend routing across whisper.cpp/insanely-fast-whisper/whisper-diarization, real-time NDJSON streaming, SQLite persistence, TTY audio transport, conformance harness. 107K lines, 2000+ tests, zero unsafe code.
This tool helps developers integrate advanced speech-to-text capabilities into their applications. It takes various audio or video files as input and produces highly structured, machine-readable text transcripts with speaker identification, suitable for automated processing. Developers creating agent-based systems or data pipelines that need reliable, robust transcription will find this invaluable.
Use this if you are a developer building applications that require consistent, high-quality, and robust speech-to-text transcription with speaker diarization and need a unified way to manage different Whisper backend engines.
Not ideal if you are an end-user simply looking for a desktop app to transcribe a single audio file or if you don't have development expertise.
Stars
13
Forks
2
Language
Rust
License
—
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Dicklesworthstone/franken_whisper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
EMUNES/Auto-Subtitle-File-Generation
Generate subtitle files with timelines in an automatic way.
zerounintezaragler/whisper_python
Whisper Python Untuk mendapatkan teks dari sebuah audio kini tidak perlu convert manual tidak...
gopiashokan/Voice-AI-Automatic-Speech-Recognition
Developed a Marathi speech-to-text application using the Hugging Face whisper ASR models....
Donny-Hikari/realtime-transcribe
Transcribe your speech or the audio playing on your computer with Whisper in realtime, and show...
papi-el/theinsyeds-whisper-analysis
Analyze OpenAI's Whisper on Mac M4 with performance benchmarks and quality assessments. Discover...