vb000/Waveformer

A deep neural network architecture for low-latency audio processing

42
/ 100
Emerging

This tool helps audio engineers, sound designers, and researchers isolate specific sounds from a mixed audio file in real time. You input an audio recording containing multiple sounds, specify the target sound you want to extract (like "Computer keyboard" or "Bark"), and it outputs a new audio file with only the requested sound, quickly and with minimal delay.

323 stars. No commits in the last 6 months.

Use this if you need to extract individual sounds from complex audio mixtures with very low processing delay, such as for live audio applications or real-time monitoring.

Not ideal if you need a solution that doesn't require a Python development environment or if you primarily work with pre-recorded, non-streaming audio where real-time performance isn't critical.

audio-separation sound-extraction real-time-audio audio-signal-processing live-sound
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

323

Forks

35

Language

Python

License

MIT

Last pushed

Aug 15, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/vb000/Waveformer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.