sp-nitech/diffsptk

A differentiable version of SPTK

/ 100

Established

This project offers essential speech signal processing tools, like those for analyzing pitch, converting text to speech, and handling audio, but with a unique twist: they're designed to integrate directly into machine learning models. It takes raw audio waveforms or speech features as input and outputs processed speech features or reconstructed audio. This is for researchers and developers building advanced speech AI systems.

196 stars. Available on PyPI.

Use this if you are developing neural network models for speech applications and need to incorporate traditional speech processing algorithms in a way that allows for end-to-end optimization.

Not ideal if you are a practitioner looking for a standalone application for basic audio editing or standard speech analysis without integrating into a deep learning pipeline.

speech-recognition speech-synthesis audio-analysis voice-processing machine-learning-research

Maintenance 10 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 14 / 25

How are scores calculated?

Stars

196

Forks

Language

Python

License

Apache-2.0

Related frameworks

trigeorgis/mdm

A TensorFlow implementation of the Mnemonic Descent Method.

clovaai/mxfont

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font...

clovaai/fewshot-font-generation

The unified repository for few-shot font generation methods. This repository includes FUNIT...

Michedev/DDPMs-Pytorch

Implementation of various DDPM papers to understand how they work

openclimatefix/diffusion_weather

Testing out Diffusion-based models for weather and PV forecasting

Explore ML Frameworks

All categories Trending ML Framework directory Insights