gongouveia/Whisper-Synthetic-ASR-Dataset-Generator

This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on the fly and manage the generated dataset 🤗. Fine tune Whisper or enhanced and custom datasets

/ 100

Experimental

This tool helps AI researchers and machine learning engineers create high-quality audio datasets for training speech recognition models. You can record audio through a simple interface, transcribe it instantly, and manage the resulting audio-text pairs. It's designed for anyone building or fine-tuning Automatic Speech Recognition (ASR) systems.

No commits in the last 6 months.

Use this if you need to generate custom audio and text pairs to improve the accuracy of speech recognition models, especially for unique accents, jargon, or specific acoustic environments.

Not ideal if you're looking for a simple audio recorder for personal notes or if you need to work with existing MP3 files directly without conversion.

ASR-dataset-generation speech-recognition-training NLP-data-creation machine-learning-engineering audio-data-management

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Kieirra/murmure

Fully local, private and cross platform Speech-to-Text with LLM Post-processing

Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

pavelzbornik/whisperX-FastAPI

FastAPI service on top of WhisperX

royshil/obs-localvocal

OBS plugin for local speech recognition and captioning using AI

Explore Voice AI Tools

All categories Trending Voice AI directory Insights