Fantasy-AMAP/fantasy-talking2

[AAAI 2026] FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation

/ 100

Experimental

This project helps animators and content creators generate highly realistic talking portraits from audio. It takes an audio input and an image of a person, and outputs a video of that person speaking the audio with natural lip movements, expressions, and overall visual quality. Anyone creating animated characters for media, marketing, or educational content would find this useful.

No commits in the last 6 months.

Use this if you need to animate a static portrait or image to realistically speak an audio track, prioritizing natural motion, accurate lip-sync, and high visual quality.

Not ideal if you're looking to generate full-body animations or modify the background environment of the video.

portrait-animation video-generation content-creation digital-media virtual-assistants

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 8 / 25

Maturity 7 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation.

ModelTC/LightX2V

Light Image Video Generation Inference Framework

thu-ml/TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

PKU-YuanGroup/Helios

Helios: Real Real-Time Long Video Generation Model

PKU-YuanGroup/MagicTime

[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Explore Diffusion Models

All categories Trending Diffusion directory Insights