M-SRIKAR-VARDHAN/speech-to-speech-with-lipsync
End-to-end speech-to-speech translation pipeline with voice cloning (RVC) and automatic lip-sync (Wav2Lip).
This tool helps content creators, educators, or media producers translate spoken content in videos from one language to another, while preserving the original speaker's voice and synchronizing their lip movements. You provide an existing video with speech in a source language (e.g., English), and it outputs a new video where the speech is translated into a target language (e.g., Telugu), with the speaker's original voice and realistic lip-sync.
Use this if you need to create dubbed versions of videos for a global audience, ensuring the speaker sounds natural and their lip movements match the new audio.
Not ideal if you only need text-to-text translation or simple audio dubbing without voice cloning or lip-sync, as the pipeline is more complex than necessary for those tasks.
Stars
23
Forks
3
Language
Python
License
—
Category
Last pushed
Nov 08, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/M-SRIKAR-VARDHAN/speech-to-speech-with-lipsync"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
primepake/wav2lip_288x288
Wav2Lip version 288 and pipeline to train
SARIT42/lipsyncr
LipSyncr is a lip reading web app based on the LipNet model that can lip read videos.
Chris10M/Lip2Speech
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
Markfryazino/wav2lip-hq
Extension of Wav2Lip repository for processing high-quality videos.
d-kavinraja/MouthMap
MouthMap is a deep learning-based lip reading system that converts silent video sequences into...