mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

51
/ 100
Established

SincNet helps with identifying who is speaking in an audio recording by analyzing raw audio waveforms. You provide sound files, and it processes them to create a customized filter bank that specifically tunes into the unique characteristics of each speaker's voice. This is ideal for researchers or engineers working on voice authentication or personalizing voice interfaces.

1,235 stars. No commits in the last 6 months.

Use this if you need to build a system that can accurately identify individual speakers from raw audio recordings.

Not ideal if your primary goal is general speech-to-text transcription, as this tool is specifically designed for speaker identification.

speaker-identification voice-biometrics audio-analysis speech-technology voice-authentication
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

1,235

Forks

270

Language

Python

License

MIT

Last pushed

Apr 28, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/mravanelli/SincNet"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.