ruslantau/media-annotator

Web-based annotation tool for media data. The easiest way to create you own media dataset.

22
/ 100
Experimental

This tool helps you quickly create datasets from audio files, especially for speech analysis. You upload audio files (like WAVs or MP3s), then manually mark or automatically transcribe speech regions across 20+ languages. The output is a collection of trimmed audio clips and a CSV or JSON file detailing the marked regions. It's ideal for linguists, researchers, or data scientists building speech-to-text models.

No commits in the last 6 months.

Use this if you need to precisely annotate speech segments in audio files and create a structured dataset for research or model training.

Not ideal if you need to annotate video, images, or other non-audio media types, or if your primary need is for advanced speaker diarization or punctuation.

audio-analysis speech-recognition-data linguistic-research data-labeling sound-transcription
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

16

Forks

Language

Vue

License

MIT

Last pushed

May 12, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ruslantau/media-annotator"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.