diggerdu/pytorch_audio

audio processing module for pytorch:stft, istft

/ 100

Emerging

This module helps you analyze and recreate audio signals by converting them between their sound wave form and their frequency components. It takes an audio signal as input and can output its frequency spectrum, or take frequency information and output a reconstructed audio signal. Audio engineers, researchers working with sound, or machine learning practitioners building audio applications would find this useful.

No commits in the last 6 months.

Use this if you need to break down audio into its constituent frequencies for analysis or to reconstruct audio from frequency data within a PyTorch workflow.

Not ideal if you need a full-featured audio editing suite or advanced signal processing capabilities beyond core time-frequency transformations.

audio-analysis sound-engineering signal-processing acoustic-research audio-machine-learning

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights