innovatorved/whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
This is a self-hosted API for converting spoken audio into written text. You feed it audio files or live audio streams, and it produces a transcript in formats like JSON, SRT, or VTT. It's designed for developers and technical teams who need to integrate high-performance speech-to-text capabilities directly into their applications or workflows, while keeping full control over their data.
914 stars. Actively maintained with 22 commits in the last 30 days.
Use this if you are a developer building an application that needs to accurately transcribe audio or live speech to text and you require data ownership and a Deepgram-compatible API.
Not ideal if you are an end-user looking for a ready-to-use application with a graphical interface for transcribing audio.
Stars
914
Forks
38
Language
Python
License
MIT
Category
Last pushed
Jan 02, 2026
Commits (30d)
22
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/innovatorved/whisper.api"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI