rioharper/VocalForge

Your one-stop solution for voice dataset creation

/ 100

Emerging

This tool helps speech technology professionals quickly create high-quality voice datasets. You provide raw audio (like podcasts or interviews) or even YouTube playlists, and it automatically cleans the audio, identifies different speakers, transcribes the speech, and aligns the text with the audio. The output is a cleanly formatted dataset, ready for training speech models like text-to-speech or hotword detection.

130 stars. No commits in the last 6 months.

Use this if you need to rapidly turn large amounts of raw audio into structured, clean datasets for training voice AI models, especially if you deal with diverse audio sources or multiple speakers.

Not ideal if you need perfectly clean, human-verified data without any post-processing, as the automated outputs require some verification and manual refinement.

voice-AI-development speech-recognition-datasets audio-processing text-to-speech hotword-detection

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

130

Forks

Language

Python

License

MIT

Higher-rated alternatives

voicegain/platform

Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)

aws-samples/amazon-transcribe-live-call-analytics

Amazon Transcribe Live Call Analytics (LCA) Sample Solution

SamirPaulb/real-time-voice-translator

A desktop application that uses AI to translate voice between languages in real time, while...

davidamacey/OpenTranscribe

Self-hosted AI-powered transcription platform with speaker diarization, search, and...

jim-schwoebel/voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python (10...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights