jianchang512/gemini-speech2srt
使用 Gemini AI 转写音视频为 SRT 字幕
This tool helps content creators, educators, and anyone working with audio or video to quickly generate accurate SRT subtitles. You input an audio or video file, and it uses AI to convert the spoken content into text, outputting a complete SRT subtitle file with precise timestamps. It's designed for individuals who need reliable subtitles without complex manual editing.
No commits in the last 6 months.
Use this if you need to convert audio or video recordings into precise SRT subtitle files for better accessibility or translation, especially when using Gemini AI's powerful transcription capabilities.
Not ideal if you prefer not to use AI-powered transcription or require extremely high-precision, frame-level timestamping for very short, specific audio events.
Stars
54
Forks
13
Language
Python
License
—
Category
Last pushed
Jan 11, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jianchang512/gemini-speech2srt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
mozilla-ai/document-to-podcast
Blueprint by Mozilla.ai for generating podcasts from documents using local AI
iMicknl/azure-podcast-generator
Generate an engaging podcast based on your document using Azure OpenAI and Azure Speech.
BandarLabs/gitpodcast
Convert any git repository into an engaging podcast
puntorigen/podcast_tts
A class for generating realistic audio (TTS) for podcasts and dialogues.
cxyfer/GeminiASR
A Python tool that uses Google Gemini API to transcribe video or audio files into SRT subtitle files.