chienhsiang-hung/voice-and-wav-cloning
通過少量語音與影片樣本生成高質量的語音與影片克隆 ( AI 人像口白生成 ),並提供多種音頻處理技術來提升音質和真實感。
This project helps content creators, marketers, or educators generate high-quality voiceovers and create realistic talking head videos from just a small amount of voice and video samples. You provide reference audio/video and text, and it outputs synthesized speech and lip-synced videos. It's designed for anyone needing to produce engaging video content efficiently without needing professional studios or actors.
No commits in the last 6 months.
Use this if you need to generate a realistic voiceover from text and synchronize it with an existing video or create a talking head animation from a single image and audio.
Not ideal if you require real-time voice synthesis or live video manipulation, as this tool focuses on offline content generation.
Stars
9
Forks
3
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Nov 04, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/chienhsiang-hung/voice-and-wav-cloning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...