Fantasy-AMAP/fantasy-talking

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

/ 100

Established

This project helps create realistic talking videos from a still image and an audio file. You provide a photo of a person and an audio recording of what you want them to say, and it generates a video where the person in the photo speaks the words from the audio with natural lip movements and facial expressions. This is ideal for content creators, marketers, or educators looking to animate static images for presentations or social media.

1,622 stars.

Use this if you need to quickly generate a video of a person speaking from just an image and an audio file, with the option to guide their non-verbal communication.

Not ideal if you need to generate full-body animated characters or require highly complex, custom animation beyond talking head videos.

video-production content-creation digital-avatars marketing-material e-learning

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

1,622

Forks

126

Language

Python

License

Apache-2.0

Related models

hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation.

ModelTC/LightX2V

Light Image Video Generation Inference Framework

thu-ml/TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

PKU-YuanGroup/Helios

Helios: Real Real-Time Long Video Generation Model

PKU-YuanGroup/MagicTime

[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Explore Diffusion Models

All categories Trending Diffusion directory Insights