neosapience/editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
This tool helps you precisely edit generated speech audio without needing to re-create the entire sound from scratch. You provide text with marked segments and specify changes to pitch or content, and it outputs natural-sounding, edited audio files. This is ideal for content creators, voice-over artists, or anyone needing to fine-tune synthetic speech.
121 stars. No commits in the last 6 months.
Use this if you need to make specific, localized adjustments to synthesized speech, such as altering a word or changing the pitch of a sentence, without re-generating the whole audio from scratch.
Not ideal if you need to generate entirely new speech from text without any editing requirements, or if you prefer a graphical user interface for audio manipulation.
Stars
121
Forks
17
Language
Python
License
—
Category
Last pushed
Jan 24, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/neosapience/editts"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC