innovatorved/whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

/ 100

Established

This is a self-hosted API for converting spoken audio into written text. You feed it audio files or live audio streams, and it produces a transcript in formats like JSON, SRT, or VTT. It's designed for developers and technical teams who need to integrate high-performance speech-to-text capabilities directly into their applications or workflows, while keeping full control over their data.

914 stars. Actively maintained with 22 commits in the last 30 days.

Use this if you are a developer building an application that needs to accurately transcribe audio or live speech to text and you require data ownership and a Deepgram-compatible API.

Not ideal if you are an end-user looking for a ready-to-use application with a graphical interface for transcribing audio.

API-development speech-to-text audio-transcription data-privacy application-integration

No Package No Dependents

Maintenance 16 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

914

Forks

Language

Python

License

MIT

Compare

whisper.api and whisperX-FastAPI whisper.api and whisper-asr-webservice whisper.api and whisper-clip whisper.api and whisper-speech-to-text whisper.api and whisper-writer

Related tools

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Kieirra/murmure

Fully local, private and cross platform Speech-to-Text with LLM Post-processing

Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

royshil/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

Explore Voice AI Tools

All categories Trending Voice AI directory Insights