my-north-ai/semantic_audio_filtering
Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.
This project helps speech technologists and AI researchers improve the accuracy of Automatic Speech Recognition (ASR) models by generating higher-quality synthetic training data. It takes your existing audio data and text, processes it through a filtering framework, and outputs augmented synthetic audio for ASR model fine-tuning. This is for professionals working on speech AI who need to enhance model performance.
No commits in the last 6 months.
Use this if you are a speech technologist or AI researcher looking to fine-tune ASR models and need to generate semantically rich, filtered synthetic audio data to improve their accuracy.
Not ideal if you are looking for a ready-to-use ASR model for direct transcription or if you do not have existing audio data to start with.
Stars
10
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 04, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/my-north-ai/semantic_audio_filtering"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech...
adi-gov-tw/Taiwan-Tongues-ASR-CE
Taiwan Tongues ASR CE 是一個開源語音辨識(Automatic Speech Recognition, ASR)模型專案,專為台灣多元語言環境設計。 本模型支援...
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
phineas-pta/fine-tune-whisper-vi
jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2
KevKibe/African-Whisper
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.