Chris10M/Lip2Speech

A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.

46
/ 100
Emerging

This tool helps convert silent video footage of a person speaking into audible speech. It takes a video of someone's face as they form words, analyzes their lip movements, and then generates an audio file of what they are saying. It's designed for researchers or practitioners working with video analysis, accessibility, or voice generation.

No commits in the last 6 months.

Use this if you need to extract spoken words from video where audio is unavailable or unclear, by analyzing the speaker's lip movements.

Not ideal if you already have clear audio or if the speaker's lips are not clearly visible in the video.

speech-synthesis video-analysis lip-reading accessibility-tech forensic-video
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

93

Forks

22

Language

Python

License

MIT

Last pushed

Jul 23, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Chris10M/Lip2Speech"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.