jianchang512/gemini-speech2srt

使用 Gemini AI 转写音视频为 SRT 字幕

/ 100

Emerging

This tool helps content creators, educators, and anyone working with audio or video to quickly generate accurate SRT subtitles. You input an audio or video file, and it uses AI to convert the spoken content into text, outputting a complete SRT subtitle file with precise timestamps. It's designed for individuals who need reliable subtitles without complex manual editing.

No commits in the last 6 months.

Use this if you need to convert audio or video recordings into precise SRT subtitle files for better accessibility or translation, especially when using Gemini AI's powerful transcription capabilities.

Not ideal if you prefer not to use AI-powered transcription or require extremely high-precision, frame-level timestamping for very short, specific audio events.

video-production education content-creation media-accessibility transcription

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Compare

gemini-speech2srt and GeminiASR

Higher-rated alternatives

mozilla-ai/document-to-podcast

Blueprint by Mozilla.ai for generating podcasts from documents using local AI

iMicknl/azure-podcast-generator

Generate an engaging podcast based on your document using Azure OpenAI and Azure Speech.

BandarLabs/gitpodcast

Convert any git repository into an engaging podcast

puntorigen/podcast_tts

A class for generating realistic audio (TTS) for podcasts and dialogues.

cxyfer/GeminiASR

A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights