peterwisu/lip-synthesis
Audio-Visual Lip Synthesis via Intermediate Landmark Representation
This tool helps create realistic lip movements on a static image or video of a face, synchronized to any given audio input. You provide a photo or video of a person's face and an audio track, and it generates a new video where the person's lips move naturally in sync with the speech or sound. This is ideal for content creators, animators, or anyone needing to generate speech-synchronized video.
No commits in the last 6 months.
Use this if you need to animate a face to speak specific words or sounds from an audio track, creating a believable talking head effect.
Not ideal if you need to generate a full avatar or character animation beyond just lip synchronization, or if you require real-time interaction for live applications.
Stars
18
Forks
4
Language
Python
License
—
Category
Last pushed
May 16, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/peterwisu/lip-synthesis"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Mrkomiljon/awesome-generative-ai
Multimodal generative AI resources : talking heads, STT, TTS, image & video generation, and more.
NVIDIA/Maya-ACE
Maya-ACE: A Reference Client Implementation for NVIDIA ACE Audio2Face Service
OpenVGLab/OmniLottie
[CVPR 2026🔥] 🧑🎨 OmniLottie, an open-sourced multi-modal instructed vector animation generator...
jdh-algo/JoyHallo
JoyHallo: Digital human model for Mandarin
michaelzhang-ai/Speech2Video
ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"