YChenL/DS-TDNN
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
This project helps speech technology researchers and developers build systems that can verify a speaker's identity from their voice. It takes raw or pre-processed audio recordings as input and outputs a confidence score or decision on whether the speaker matches a known identity. Researchers working on voice biometrics, security systems, or personalized voice assistants would use this.
No commits in the last 6 months.
Use this if you are a speech technology researcher or developer looking to experiment with or implement advanced models for speaker verification.
Not ideal if you are a non-technical user seeking a ready-to-use speaker verification application or service.
Stars
41
Forks
8
Language
Python
License
—
Category
Last pushed
Aug 31, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/YChenL/DS-TDNN"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz...
drethage/speech-denoising-wavenet
A neural network for end-to-end speech denoising
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
iver56/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
lmnt-com/wavegrad
A fast, high-quality neural vocoder.