chienhsiang-hung/voice-and-wav-cloning

通過少量語音與影片樣本生成高質量的語音與影片克隆 ( AI 人像口白生成 )，並提供多種音頻處理技術來提升音質和真實感。

/ 100

Emerging

This project helps content creators, marketers, or educators generate high-quality voiceovers and create realistic talking head videos from just a small amount of voice and video samples. You provide reference audio/video and text, and it outputs synthesized speech and lip-synced videos. It's designed for anyone needing to produce engaging video content efficiently without needing professional studios or actors.

No commits in the last 6 months.

Use this if you need to generate a realistic voiceover from text and synchronize it with an existing video or create a talking head animation from a single image and audio.

Not ideal if you require real-time voice synthesis or live video manipulation, as this tool focuses on offline content generation.

video-production content-creation voiceover animation digital-marketing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

pnnbao97/VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to automatically...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights