manhph2211/ViTTS

In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system :smile: In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...

13
/ 100
Experimental

This project helps audio engineers and content creators quickly convert written text into high-quality, natural-sounding speech from multiple distinct voices. You provide text and audio samples for each desired speaker, and it generates audio files of that text spoken in their voice. This is ideal for those needing to produce diverse voice content efficiently.

No commits in the last 6 months.

Use this if you need to generate personalized, multi-speaker voiceovers or audio content from text using existing voice samples.

Not ideal if you're looking for a pre-trained, ready-to-use text-to-speech service without any setup or custom model training.

voice-synthesis audio-content-creation narration digital-voice-cloning speech-generation
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 0 / 25

How are scores calculated?

Stars

12

Forks

Language

Python

License

Last pushed

Nov 24, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/manhph2211/ViTTS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.