lucadellalib/audiocodecs
A collections of audio codecs with a standardized API
This is a collection of audio compression methods (codecs) designed for developers working with audio signals. It allows you to take raw audio data and convert it into a compressed tokenized format, then reconstruct it back into audio. This tool is for machine learning engineers and researchers who are building or evaluating speech and audio processing systems.
No commits in the last 6 months.
Use this if you are a developer looking for a standardized way to integrate, experiment with, and benchmark various state-of-the-art audio codecs in your machine learning projects.
Not ideal if you are an end-user simply looking to compress audio files for storage or sharing; this is a developer tool, not a consumer application.
Stars
36
Forks
3
Language
Python
License
Apache-2.0
Category
Last pushed
May 27, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/lucadellalib/audiocodecs"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
fatchord/WaveRNN
WaveRNN Vocoder + TTS
shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...
seungwonpark/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)