mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

/ 100

Established

SincNet helps with identifying who is speaking in an audio recording by analyzing raw audio waveforms. You provide sound files, and it processes them to create a customized filter bank that specifically tunes into the unique characteristics of each speaker's voice. This is ideal for researchers or engineers working on voice authentication or personalizing voice interfaces.

1,235 stars. No commits in the last 6 months.

Use this if you need to build a system that can accurately identify individual speakers from raw audio recordings.

Not ideal if your primary goal is general speech-to-text transcription, as this tool is specifically designed for speaker identification.

speaker-identification voice-biometrics audio-analysis speech-technology voice-authentication

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

1,235

Forks

270

Language

Python

License

MIT

Related frameworks

iver56/audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the...

Rikorose/DeepFilterNet

Noise supression using deep filtering

torchsynth/torchsynth

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

marl/openl3

OpenL3: Open-source deep audio and image embeddings

archinetai/audio-data-pytorch

A collection of useful audio datasets and transforms for PyTorch.

Explore ML Frameworks

All categories Trending ML Framework directory Insights