gongouveia/Whisper-Synthetic-ASR-Dataset-Generator
This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on the fly and manage the generated dataset 🤗. Fine tune Whisper or enhanced and custom datasets
This tool helps AI researchers and machine learning engineers create high-quality audio datasets for training speech recognition models. You can record audio through a simple interface, transcribe it instantly, and manage the resulting audio-text pairs. It's designed for anyone building or fine-tuning Automatic Speech Recognition (ASR) systems.
No commits in the last 6 months.
Use this if you need to generate custom audio and text pairs to improve the accuracy of speech recognition models, especially for unique accents, jargon, or specific acoustic environments.
Not ideal if you're looking for a simple audio recorder for personal notes or if you need to work with existing MP3 files directly without conversion.
Stars
32
Forks
2
Language
Python
License
—
Category
Last pushed
Nov 26, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gongouveia/Whisper-Synthetic-ASR-Dataset-Generator"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Kieirra/murmure
Fully local, private and cross platform Speech-to-Text with LLM Post-processing
Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
pavelzbornik/whisperX-FastAPI
FastAPI service on top of WhisperX
royshil/obs-localvocal
OBS plugin for local speech recognition and captioning using AI