rioharper/VocalForge
Your one-stop solution for voice dataset creation
This tool helps speech technology professionals quickly create high-quality voice datasets. You provide raw audio (like podcasts or interviews) or even YouTube playlists, and it automatically cleans the audio, identifies different speakers, transcribes the speech, and aligns the text with the audio. The output is a cleanly formatted dataset, ready for training speech models like text-to-speech or hotword detection.
130 stars. No commits in the last 6 months.
Use this if you need to rapidly turn large amounts of raw audio into structured, clean datasets for training voice AI models, especially if you deal with diverse audio sources or multiple speakers.
Not ideal if you need perfectly clean, human-verified data without any post-processing, as the automated outputs require some verification and manual refinement.
Stars
130
Forks
24
Language
Python
License
MIT
Category
Last pushed
Dec 10, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rioharper/VocalForge"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
voicegain/platform
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
aws-samples/amazon-transcribe-live-call-analytics
Amazon Transcribe Live Call Analytics (LCA) Sample Solution
SamirPaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while...
davidamacey/OpenTranscribe
Self-hosted AI-powered transcription platform with speaker diarization, search, and...
jim-schwoebel/voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10...