cxyfer/GeminiASR
A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.
This tool helps content creators, educators, and anyone working with video or audio efficiently generate subtitle files. You input video (MP4, AVI, MKV) or audio (MP3, WAV) files, and it outputs an SRT subtitle file with precise timestamps. It's designed for individuals or teams who need to quickly add accurate captions to their media.
Use this if you need to quickly and accurately transcribe video or audio content into SRT subtitle files, even for long recordings or multiple files.
Not ideal if you require human-level transcription accuracy for highly nuanced or specialized content, or if you need to generate subtitles in formats other than SRT.
Stars
17
Forks
5
Language
Python
License
MIT
Category
Last pushed
Jan 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/cxyfer/GeminiASR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
mozilla-ai/document-to-podcast
Blueprint by Mozilla.ai for generating podcasts from documents using local AI
iMicknl/azure-podcast-generator
Generate an engaging podcast based on your document using Azure OpenAI and Azure Speech.
BandarLabs/gitpodcast
Convert any git repository into an engaging podcast
puntorigen/podcast_tts
A class for generating realistic audio (TTS) for podcasts and dialogues.
amscotti/hn-podcaster
The HackerNews Podcaster is a JavaScript application that utilizes the power of OpenAI's...