Advocate99/DiffGesture

[CVPR'2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

/ 100

Established

This project helps create realistic co-speech gestures for virtual characters or avatars, making human-machine interactions more natural. It takes audio recordings of speech as input and generates corresponding body movements, specifically skeleton sequences that define the character's gestures. This is useful for animators, content creators, or researchers working with virtual assistants, digital actors, or interactive simulations.

261 stars.

Use this if you need to animate virtual avatars with natural, synchronized gestures based on spoken audio.

Not ideal if you need to generate gestures from non-speech audio or if you're looking for a simple drag-and-drop animation solution without coding.

virtual-avatar-animation character-design human-machine-interaction digital-storytelling virtual-reality

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

261

Forks

Language

Python

License

GPL-3.0

Compare

DiffGesture and DiffuseStyleGesture

Related models

hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation.

ModelTC/LightX2V

Light Image Video Generation Inference Framework

thu-ml/TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

PKU-YuanGroup/Helios

Helios: Real Real-Time Long Video Generation Model

PKU-YuanGroup/MagicTime

[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Explore Diffusion Models

All categories Trending Diffusion directory Insights