YChenL/DS-TDNN

Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch

/ 100

Emerging

This project helps speech technology researchers and developers build systems that can verify a speaker's identity from their voice. It takes raw or pre-processed audio recordings as input and outputs a confidence score or decision on whether the speaker matches a known identity. Researchers working on voice biometrics, security systems, or personalized voice assistants would use this.

No commits in the last 6 months.

Use this if you are a speech technology researcher or developer looking to experiment with or implement advanced models for speaker verification.

Not ideal if you are a non-technical user seeking a ready-to-use speaker verification application or service.

speaker-verification voice-biometrics speech-technology audio-security voice-recognition-research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

descriptinc/descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz...

drethage/speech-denoising-wavenet

A neural network for end-to-end speech denoising

YuanGongND/ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

iver56/torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

lmnt-com/wavegrad

A fast, high-quality neural vocoder.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights