folubebe/gemini_realtime_speech_to_text
Real-time speech translation using Google Gemini API for free
This tool helps you understand spoken language in real-time, even across different languages. You speak into your microphone, and the text appears on your screen, translated into your chosen language, and is also saved to a file. It's ideal for anyone who needs to quickly grasp conversations or lectures happening in a foreign language, like international travelers, students, or business professionals.
No commits in the last 6 months.
Use this if you need an immediate, written translation of spoken words from one language to another.
Not ideal if you require perfectly polished, human-quality translations for formal documents or publications.
Stars
10
Forks
3
Language
Python
License
—
Category
Last pushed
Mar 18, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/folubebe/gemini_realtime_speech_to_text"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
mozilla-ai/document-to-podcast
Blueprint by Mozilla.ai for generating podcasts from documents using local AI
iMicknl/azure-podcast-generator
Generate an engaging podcast based on your document using Azure OpenAI and Azure Speech.
BandarLabs/gitpodcast
Convert any git repository into an engaging podcast
puntorigen/podcast_tts
A class for generating realistic audio (TTS) for podcasts and dialogues.
cxyfer/GeminiASR
A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.