JusperLee/Dual-Path-RNN-Pytorch

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

/ 100

Emerging

This project helps separate individual voices from a single audio recording where multiple people are speaking at once. You feed in an audio file with mixed speech, and it produces separate audio files, each containing the isolated speech of one person. This is useful for researchers and developers working on speech processing applications, such as improving transcription accuracy or enhancing specific voices.

464 stars. No commits in the last 6 months.

Use this if you need to cleanly isolate individual speech signals from a single audio track containing a mixture of voices.

Not ideal if you're looking for a ready-to-use application with a graphical interface for end-users, as this requires technical setup and command-line execution.

speech-separation audio-enhancement voice-processing acoustic-research speech-transcription

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

464

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

asteroid-team/asteroid

The PyTorch-based audio source separation toolkit for researchers

deezer/spleeter

Deezer source separation library including pretrained models.

audeering/opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

audeering/opensmile-python

Python package for openSMILE

Explore ML Frameworks

All categories Trending ML Framework directory Insights