happytunesai/EZ-STT-Logger-GUI
Python GUI for real-time Speech-to-Text (STT) using local Whisper, OpenAI API, or ElevenLabs API. Features audio logging, filtering, replacements, WebSocket control (Stream Deck), and Streamer.bot integration.
This application helps content creators, streamers, and anyone needing live text from spoken words. It takes audio input from your microphone and instantly converts it into text, which can be logged, filtered, and used in other applications. The ideal user is a streamer or content creator who wants to display live captions or integrate speech-to-text with tools like Streamer.bot or PNGTuber-GPT.
No commits in the last 6 months.
Use this if you need real-time transcription of spoken audio for live streaming, content creation, or accessible captioning, and want flexible options for transcription engines (local or cloud-based) and integration with streaming tools.
Not ideal if you need to transcribe pre-recorded audio files offline without real-time interaction, or if your primary need is highly accurate, non-conversational transcription for professional legal or medical documentation.
Stars
7
Forks
—
Language
Python
License
MIT
Category
Last pushed
May 11, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/happytunesai/EZ-STT-Logger-GUI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
whitphx/streamlit-stt-app
Real time web based Speech-to-Text app with Streamlit
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to...
saidsef/tika-document-to-text
Apache Tika extract text and metadata from any document format with this pre-built containerised...
declare-lab/jamify
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
SiddhantSadangi/st_deepgram_playground
API playground for Deepgram built with Streamlit