M-SRIKAR-VARDHAN/speech-to-speech-with-lipsync

End-to-end speech-to-speech translation pipeline with voice cloning (RVC) and automatic lip-sync (Wav2Lip).

30
/ 100
Emerging

This tool helps content creators, educators, or media producers translate spoken content in videos from one language to another, while preserving the original speaker's voice and synchronizing their lip movements. You provide an existing video with speech in a source language (e.g., English), and it outputs a new video where the speech is translated into a target language (e.g., Telugu), with the speaker's original voice and realistic lip-sync.

Use this if you need to create dubbed versions of videos for a global audience, ensuring the speaker sounds natural and their lip movements match the new audio.

Not ideal if you only need text-to-text translation or simple audio dubbing without voice cloning or lip-sync, as the pipeline is more complex than necessary for those tasks.

video-localization content-dubbing voice-cloning media-production educational-content
No License No Package No Dependents
Maintenance 6 / 25
Adoption 6 / 25
Maturity 7 / 25
Community 11 / 25

How are scores calculated?

Stars

23

Forks

3

Language

Python

License

Last pushed

Nov 08, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/M-SRIKAR-VARDHAN/speech-to-speech-with-lipsync"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.