soCzech/GenHowTo

Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024

/ 100

Emerging

This project helps computer vision researchers and AI practitioners generate images showing an object's future state or the action that transforms it. You provide an initial image and a text description, and it outputs a new image depicting the visual change or action. This is for those working on tasks like robotic task learning or instructional video analysis.

No commits in the last 6 months.

Use this if you need to visualize potential future states or the actions involved in transforming objects, based on an input image and a descriptive prompt.

Not ideal if you are looking for a general-purpose image generation tool or a system that plans sequences of actions.

computer-vision-research AI-image-generation robotic-task-learning instructional-video-analysis visual-state-prediction

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

ZiYang-xie/WorldGen

🌍 WorldGen - Generate Any 3D Scene in Seconds

aioz-ai/AIOZ-GDANCE

AIOZ-GDANCE: a large-scale dataset & baseline for music-driven group dance generation. (CVPR 2023)

worldbench/WorldLens

[CVPR 2026] WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Kobaayyy/Awesome-CVPR2026-CVPR2025-ICCV2025-CVPR2024-ECCV2024-AIGC

A Collection of Papers and Codes for CVPR2026/CVPR2025/ICCV2025/CVPR2024/ECCV2024 AIGC

nv-tlabs/XCube

[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

Explore Generative AI Tools

All categories Trending Generative AI directory Insights