rafaelvalle/asrgen

Attacking Speaker Recognition with Deep Generative Models

/ 100

Emerging

This project helps security researchers and voice biometric developers explore vulnerabilities in speaker recognition systems. It takes existing audio data and uses deep generative models to create synthetic audio samples that can deceive these systems. The output is 'fake' audio that sounds like a target speaker, useful for evaluating the robustness of voice authentication.

No commits in the last 6 months.

Use this if you need to generate adversarial audio samples to test the security and reliability of speaker recognition or voice biometric systems.

Not ideal if you are looking to build a new speaker recognition system or for general audio synthesis unrelated to security testing.

voice-biometrics speaker-recognition security-research adversarial-audio voice-authentication

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights