choyingw/Cross-Modal-Perceptionist

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

40
/ 100
Emerging

This project explores the intriguing connection between a person's voice and the unique geometry of their face. It takes an audio recording of a person speaking and predicts a 3D model of their face, including details about its shape and structure. This could be used by researchers in fields like forensics, biometrics, or computational psychology to analyze underlying physiological correlations.

131 stars. No commits in the last 6 months.

Use this if you are a researcher studying the complex relationships between vocal characteristics and facial biometrics, particularly for understanding genetic or physiological links.

Not ideal if you need to identify individuals from their voice, as this project focuses on predicting general facial geometry rather than specific person identification.

biometrics forensics computational-psychology voice-analysis facial-modeling
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

131

Forks

15

Language

Python

License

MIT

Last pushed

Dec 11, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/choyingw/Cross-Modal-Perceptionist"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.