folubebe/gemini_realtime_speech_to_text

Real-time speech translation using Google Gemini API for free

/ 100

Experimental

This tool helps you understand spoken language in real-time, even across different languages. You speak into your microphone, and the text appears on your screen, translated into your chosen language, and is also saved to a file. It's ideal for anyone who needs to quickly grasp conversations or lectures happening in a foreign language, like international travelers, students, or business professionals.

No commits in the last 6 months.

Use this if you need an immediate, written translation of spoken words from one language to another.

Not ideal if you require perfectly polished, human-quality translations for formal documents or publications.

live-translation language-learning international-communication multilingual-meetings transcription

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

mozilla-ai/document-to-podcast

Blueprint by Mozilla.ai for generating podcasts from documents using local AI

iMicknl/azure-podcast-generator

Generate an engaging podcast based on your document using Azure OpenAI and Azure Speech.

BandarLabs/gitpodcast

Convert any git repository into an engaging podcast

puntorigen/podcast_tts

A class for generating realistic audio (TTS) for podcasts and dialogues.

cxyfer/GeminiASR

A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights