hammaad2002/ASRAdversarialAttacks
An ASR (Automatic Speech Recognition) adversarial attack repository.
This helps you test how vulnerable an Automatic Speech Recognition (ASR) system is to subtle changes in audio. You provide an audio file and an ASR model, and it outputs a modified audio file designed to trick the ASR system into transcribing it incorrectly, even though a human listener might not notice the change. It's useful for researchers or developers working on speech recognition technology.
No commits in the last 6 months.
Use this if you need to evaluate the security and robustness of your ASR models against intentionally crafted audio designed to fool them.
Not ideal if you are looking to improve the general accuracy of your ASR system or simply transcribe audio without testing its vulnerabilities.
Stars
39
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Nov 07, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/hammaad2002/ASRAdversarialAttacks"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
guxm2021/ALT_SpeechBrain
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
subhasis-ai/Hindi-ASR-Wav2Vec2
This repository demonstrates development of Hindi ASR model using transformers.
guxm2021/MM_ALT
[MM 2022] MM-ALT: A Multimodal Automatic Lyric Transcription System (Oral, Top paper award)
jvel07/wav2vec2_patho
Fine-tuning wav2vec2 to for Pathological Speech Processing
maximkm/DLA_ASR_HW
ASR pytorch project