ruslantau/media-annotator
Web-based annotation tool for media data. The easiest way to create you own media dataset.
This tool helps you quickly create datasets from audio files, especially for speech analysis. You upload audio files (like WAVs or MP3s), then manually mark or automatically transcribe speech regions across 20+ languages. The output is a collection of trimmed audio clips and a CSV or JSON file detailing the marked regions. It's ideal for linguists, researchers, or data scientists building speech-to-text models.
No commits in the last 6 months.
Use this if you need to precisely annotate speech segments in audio files and create a structured dataset for research or model training.
Not ideal if you need to annotate video, images, or other non-audio media types, or if your primary need is for advanced speaker diarization or punctuation.
Stars
16
Forks
—
Language
Vue
License
MIT
Category
Last pushed
May 12, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ruslantau/media-annotator"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jdepoix/youtube-transcript-api
This is a python API which allows you to get the transcript/subtitles for a given YouTube video....
MatteoFasulo/Whisper-TikTok
From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model
pszemraj/vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
antor44/livestream_video
playlist4whisper manages media streams playlists for livestream_video.sh, plays media, and...
ArthurFDLR/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper