HAKORADev/VODER
Voice Operation and Design Engine with Reproduction capabilities
This tool helps content creators, audio professionals, and marketers convert between speech, text, and music effortlessly. You can input audio, video, images, or even YouTube links, and it generates high-quality synthesized speech, cloned voices, transcribed text, or background music. Anyone producing podcasts, audiobooks, news broadcasts, or marketing content will find this useful.
116 stars.
Use this if you need to quickly generate spoken audio from text, clone voices, transcribe various media into text, or create multi-speaker dialogue with optional background music.
Not ideal if you primarily need advanced music composition or intricate sound design features beyond basic background music generation.
Stars
116
Forks
9
Language
Python
License
MIT
Category
Last pushed
Mar 18, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/HAKORADev/VODER"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
fatchord/WaveRNN
WaveRNN Vocoder + TTS
shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...
seungwonpark/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)