pszemraj/vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
This tool helps anyone who needs to quickly get written content from spoken videos. It takes video files with speech, processes the audio, and outputs clean, readable text transcripts along with identified keywords. Marketers, researchers, or educators could use this to make video content searchable and easier to analyze.
220 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to transform hours of video recordings into a searchable text format, enabling quick analysis and information extraction.
Not ideal if you require real-time transcription or are working with videos that have very little spoken content.
Stars
220
Forks
29
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Oct 29, 2024
Commits (30d)
0
Dependencies
25
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/pszemraj/vid2cleantxt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
jdepoix/youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a given YouTube video....
MatteoFasulo/Whisper-TikTok
From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model
antor44/livestream_video
playlist4whisper manages media streams playlists for livestream_video.sh, plays media, and...
ArthurFDLR/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper
JJWRoeloffs/transcribe_align_textgrid
A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids...