rioharper/VocalForge

Your one-stop solution for voice dataset creation

45
/ 100
Emerging

This tool helps speech technology professionals quickly create high-quality voice datasets. You provide raw audio (like podcasts or interviews) or even YouTube playlists, and it automatically cleans the audio, identifies different speakers, transcribes the speech, and aligns the text with the audio. The output is a cleanly formatted dataset, ready for training speech models like text-to-speech or hotword detection.

130 stars. No commits in the last 6 months.

Use this if you need to rapidly turn large amounts of raw audio into structured, clean datasets for training voice AI models, especially if you deal with diverse audio sources or multiple speakers.

Not ideal if you need perfectly clean, human-verified data without any post-processing, as the automated outputs require some verification and manual refinement.

voice-AI-development speech-recognition-datasets audio-processing text-to-speech hotword-detection
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

130

Forks

24

Language

Python

License

MIT

Last pushed

Dec 10, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rioharper/VocalForge"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.