theshajha/whisper-realtime-speech-to-text-summary
Transcribe real-world speech with an API call. Based on Whisper(ASR by OpenAI) - https://openai.com/blog/whisper/
This tool helps you quickly transcribe spoken conversations and identify who said what. You feed in an audio recording, and it provides a written transcript with speaker labels. It's designed for anyone who needs to make sense of spoken content, like researchers analyzing interviews or businesses reviewing meeting recordings.
No commits in the last 6 months.
Use this if you need to generate a text transcript from an audio recording and want to know which parts were spoken by different individuals.
Not ideal if you need real-time transcription during a live conversation or if you primarily need a summary of the audio content rather than a full transcript.
Stars
8
Forks
—
Language
Python
License
—
Category
Last pushed
Feb 23, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/theshajha/whisper-realtime-speech-to-text-summary"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI