sp-nitech/diffsptk
A differentiable version of SPTK
This project offers essential speech signal processing tools, like those for analyzing pitch, converting text to speech, and handling audio, but with a unique twist: they're designed to integrate directly into machine learning models. It takes raw audio waveforms or speech features as input and outputs processed speech features or reconstructed audio. This is for researchers and developers building advanced speech AI systems.
196 stars. Available on PyPI.
Use this if you are developing neural network models for speech applications and need to incorporate traditional speech processing algorithms in a way that allows for end-to-end optimization.
Not ideal if you are a practitioner looking for a standalone application for basic audio editing or standard speech analysis without integrating into a deep learning pipeline.
Stars
196
Forks
20
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 26, 2026
Commits (30d)
0
Dependencies
11
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/sp-nitech/diffsptk"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
trigeorgis/mdm
A TensorFlow implementation of the Mnemonic Descent Method.
clovaai/mxfont
Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font...
clovaai/fewshot-font-generation
The unified repository for few-shot font generation methods. This repository includes FUNIT...
Michedev/DDPMs-Pytorch
Implementation of various DDPM papers to understand how they work
openclimatefix/diffusion_weather
Testing out Diffusion-based models for weather and PV forecasting