ruslantau/media-annotator

Web-based annotation tool for media data. The easiest way to create you own media dataset.

/ 100

Experimental

This tool helps you quickly create datasets from audio files, especially for speech analysis. You upload audio files (like WAVs or MP3s), then manually mark or automatically transcribe speech regions across 20+ languages. The output is a collection of trimmed audio clips and a CSV or JSON file detailing the marked regions. It's ideal for linguists, researchers, or data scientists building speech-to-text models.

No commits in the last 6 months.

Use this if you need to precisely annotate speech segments in audio files and create a structured dataset for research or model training.

Not ideal if you need to annotate video, images, or other non-audio media types, or if your primary need is for advanced speaker diarization or punctuation.

audio-analysis speech-recognition-data linguistic-research data-labeling sound-transcription

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Vue

License

MIT

Higher-rated alternatives

jdepoix/youtube-transcript-api

This is a python API which allows you to get the transcript/subtitles for a given YouTube video....

MatteoFasulo/Whisper-TikTok

From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model

pszemraj/vid2cleantxt

Python API & command-line tool to easily transcribe speech-based video files into clean text

antor44/livestream_video

playlist4whisper manages media streams playlists for livestream_video.sh, plays media, and...

ArthurFDLR/whisper-youtube

🔉 Youtube Videos Transcription with OpenAI's Whisper

Explore Voice AI Tools

All categories Trending Voice AI directory Insights