erogol/FFTNet

FFTNet vocoder implementation

37
/ 100
Emerging

This tool takes raw audio recordings, analyzes their sound characteristics, and then converts these characteristics into new, realistic-sounding speech. It's used by researchers and developers working on speech synthesis to efficiently generate human-like voices from audio features.

No commits in the last 6 months.

Use this if you need to transform the acoustic properties of audio, like a spectrogram, into high-fidelity speech waveforms for synthetic voice generation.

Not ideal if you're looking for a tool to transcribe speech to text, translate languages, or perform general audio editing.

speech-synthesis voice-generation text-to-speech audio-processing vocoding
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 12 / 25

How are scores calculated?

Stars

81

Forks

8

Language

Jupyter Notebook

License

MPL-2.0

Last pushed

Sep 28, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/erogol/FFTNet"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.