cxyfer/GeminiASR

A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.

/ 100

Emerging

This tool helps content creators, educators, and anyone working with video or audio efficiently generate subtitle files. You input video (MP4, AVI, MKV) or audio (MP3, WAV) files, and it outputs an SRT subtitle file with precise timestamps. It's designed for individuals or teams who need to quickly add accurate captions to their media.

Use this if you need to quickly and accurately transcribe video or audio content into SRT subtitle files, even for long recordings or multiple files.

Not ideal if you require human-level transcription accuracy for highly nuanced or specialized content, or if you need to generate subtitles in formats other than SRT.

video-editing content-creation e-learning media-production accessibility

No Package No Dependents

Maintenance 6 / 25

Adoption 6 / 25

Maturity 15 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Compare

GeminiASR and gemini-speech2srt

Higher-rated alternatives

mozilla-ai/document-to-podcast

Blueprint by Mozilla.ai for generating podcasts from documents using local AI

iMicknl/azure-podcast-generator

Generate an engaging podcast based on your document using Azure OpenAI and Azure Speech.

BandarLabs/gitpodcast

Convert any git repository into an engaging podcast

puntorigen/podcast_tts

A class for generating realistic audio (TTS) for podcasts and dialogues.

amscotti/hn-podcaster

The HackerNews Podcaster is a JavaScript application that utilizes the power of OpenAI's...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights