lucadellalib/audiocodecs

A collections of audio codecs with a standardized API

/ 100

Emerging

This is a collection of audio compression methods (codecs) designed for developers working with audio signals. It allows you to take raw audio data and convert it into a compressed tokenized format, then reconstruct it back into audio. This tool is for machine learning engineers and researchers who are building or evaluating speech and audio processing systems.

No commits in the last 6 months.

Use this if you are a developer looking for a standardized way to integrate, experiment with, and benchmark various state-of-the-art audio codecs in your machine learning projects.

Not ideal if you are an end-user simply looking to compress audio files for storage or sharing; this is a developer tool, not a consumer application.

audio-processing speech-technology machine-learning-engineering audio-compression research-prototyping

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights