my-north-ai/semantic_audio_filtering

Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.

/ 100

Experimental

This project helps speech technologists and AI researchers improve the accuracy of Automatic Speech Recognition (ASR) models by generating higher-quality synthetic training data. It takes your existing audio data and text, processes it through a filtering framework, and outputs augmented synthetic audio for ASR model fine-tuning. This is for professionals working on speech AI who need to enhance model performance.

No commits in the last 6 months.

Use this if you are a speech technologist or AI researcher looking to fine-tune ASR models and need to generate semantically rich, filtered synthetic audio data to improve their accuracy.

Not ideal if you are looking for a ready-to-use ASR model for direct transcription or if you do not have existing audio data to start with.

speech-recognition AI-research natural-language-processing data-augmentation model-training

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

Apache-2.0

Higher-rated alternatives

YuanGongND/whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech...

adi-gov-tw/Taiwan-Tongues-ASR-CE

Taiwan Tongues ASR CE 是一個開源語音辨識（Automatic Speech Recognition, ASR）模型專案，專為台灣多元語言環境設計。本模型支援...

huggingface/distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

phineas-pta/fine-tune-whisper-vi

jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2

KevKibe/African-Whisper

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights