gongouveia/Whisper-Synthetic-ASR-Dataset-Generator

This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on the fly and manage the generated dataset 🤗. Fine tune Whisper or enhanced and custom datasets

21
/ 100
Experimental

This tool helps AI researchers and machine learning engineers create high-quality audio datasets for training speech recognition models. You can record audio through a simple interface, transcribe it instantly, and manage the resulting audio-text pairs. It's designed for anyone building or fine-tuning Automatic Speech Recognition (ASR) systems.

No commits in the last 6 months.

Use this if you need to generate custom audio and text pairs to improve the accuracy of speech recognition models, especially for unique accents, jargon, or specific acoustic environments.

Not ideal if you're looking for a simple audio recorder for personal notes or if you need to work with existing MP3 files directly without conversion.

ASR-dataset-generation speech-recognition-training NLP-data-creation machine-learning-engineering audio-data-management
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 6 / 25

How are scores calculated?

Stars

32

Forks

2

Language

Python

License

Last pushed

Nov 26, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/gongouveia/Whisper-Synthetic-ASR-Dataset-Generator"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.