jdh-algo/JoyHallo
JoyHallo: Digital human model for Mandarin
This tool helps marketing professionals, educators, and content creators generate realistic digital human videos for Mandarin speech. You provide a still image of a person and an audio recording of Mandarin speech, and it outputs a video of that person speaking the words with natural lip movements and expressions. It's designed for anyone needing to create engaging video content with digital presenters.
522 stars. No commits in the last 6 months.
Use this if you need to create video content with a digital spokesperson delivering Mandarin speech, especially for presentations, e-learning, or marketing materials.
Not ideal if you primarily need to generate videos for languages other than Mandarin or English, or if you require real-time, live interaction with the digital human.
Stars
522
Forks
51
Language
Python
License
MIT
Category
Last pushed
Sep 21, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/jdh-algo/JoyHallo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Mrkomiljon/awesome-generative-ai
Multimodal generative AI resources : talking heads, STT, TTS, image & video generation, and more.
NVIDIA/Maya-ACE
Maya-ACE: A Reference Client Implementation for NVIDIA ACE Audio2Face Service
OpenVGLab/OmniLottie
[CVPR 2026🔥] 🧑🎨 OmniLottie, an open-sourced multi-modal instructed vector animation generator...
michaelzhang-ai/Speech2Video
ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"
Boese0601/Dyadic-Interaction-Modeling
[ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation