soCzech/GenHowTo

Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024

33
/ 100
Emerging

This project helps computer vision researchers and AI practitioners generate images showing an object's future state or the action that transforms it. You provide an initial image and a text description, and it outputs a new image depicting the visual change or action. This is for those working on tasks like robotic task learning or instructional video analysis.

No commits in the last 6 months.

Use this if you need to visualize potential future states or the actions involved in transforming objects, based on an input image and a descriptive prompt.

Not ideal if you are looking for a general-purpose image generation tool or a system that plans sequences of actions.

computer-vision-research AI-image-generation robotic-task-learning instructional-video-analysis visual-state-prediction
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 9 / 25

How are scores calculated?

Stars

53

Forks

4

Language

Python

License

MIT

Last pushed

Mar 03, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/soCzech/GenHowTo"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.