yl4579/PitchExtractor

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

47
/ 100
Emerging

This project helps developers train deep neural networks for advanced voice manipulation, specifically for tasks like voice conversion or generating speech from text. It takes audio recordings and processes them to extract fundamental frequency (pitch) information, which is crucial for creating natural-sounding synthetic voices. This tool is designed for machine learning engineers and researchers working on speech synthesis and voice conversion technologies.

147 stars. No commits in the last 6 months.

Use this if you are a machine learning engineer building a voice conversion system or a text-to-speech model and need to accurately extract pitch contours from audio data to train your neural network.

Not ideal if you are an end-user looking for a ready-to-use application to convert voices or generate speech; this is a component for building such systems.

voice-conversion speech-synthesis deep-learning audio-processing machine-learning-engineering
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 21 / 25

How are scores calculated?

Stars

147

Forks

34

Language

Python

License

MIT

Last pushed

Aug 22, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/yl4579/PitchExtractor"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.