soCzech/GenHowTo
Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024
This project helps computer vision researchers and AI practitioners generate images showing an object's future state or the action that transforms it. You provide an initial image and a text description, and it outputs a new image depicting the visual change or action. This is for those working on tasks like robotic task learning or instructional video analysis.
No commits in the last 6 months.
Use this if you need to visualize potential future states or the actions involved in transforming objects, based on an input image and a descriptive prompt.
Not ideal if you are looking for a general-purpose image generation tool or a system that plans sequences of actions.
Stars
53
Forks
4
Language
Python
License
MIT
Category
Last pushed
Mar 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/soCzech/GenHowTo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ZiYang-xie/WorldGen
🌍 WorldGen - Generate Any 3D Scene in Seconds
aioz-ai/AIOZ-GDANCE
AIOZ-GDANCE: a large-scale dataset & baseline for music-driven group dance generation. (CVPR 2023)
worldbench/WorldLens
[CVPR 2026] WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World
Kobaayyy/Awesome-CVPR2026-CVPR2025-ICCV2025-CVPR2024-ECCV2024-AIGC
A Collection of Papers and Codes for CVPR2026/CVPR2025/ICCV2025/CVPR2024/ECCV2024 AIGC
nv-tlabs/XCube
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies