rishikksh20/Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
This tool helps content creators and narrators seamlessly insert new sentences or phrases into existing audio narration using only text. You provide the original audio file and the text you want to add, and it generates the new audio segment in a voice that matches the original. It's ideal for anyone who needs to make small edits or additions to spoken content without re-recording the entire piece.
No commits in the last 6 months.
Use this if you need to add text-based insertions into an existing audio narration and want the new speech to sound consistent with the original speaker's voice.
Not ideal if you're looking for a complete text-to-speech or speech synthesis solution with training capabilities, as this focuses specifically on zero-shot insertion.
Stars
34
Forks
3
Language
Python
License
MIT
Category
Last pushed
Sep 24, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rishikksh20/Zero-Shot-TTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
index-tts/index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
stepfun-ai/Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
unilight/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
FireRedTeam/FireRedTTS
An Open-Sourced LLM-empowered Foundation TTS System