EagleW/Multimedia-Generative-Script-Learning

Official implementation of the ACL Findings 2023 paper: Multimedia Generative Script Learning for Task Planning

20
/ 100
Experimental

This project helps generate the next logical steps for a given task, based on both text descriptions and images of previous steps. You input a task's title, previous method, a list of step texts, corresponding image captions, and the last step's image, and it outputs the predicted next step text and image. It's designed for developers building automated task planning or instructional systems, particularly for crafts and gardening.

No commits in the last 6 months.

Use this if you are a machine learning researcher or developer working on AI models that need to predict sequential, multi-modal steps for tasks like crafts or gardening.

Not ideal if you are a casual user looking for a ready-to-use application to generate instructions, or if your tasks fall outside of detailed instructional sequences for crafts or gardening.

task-planning generative-AI crafts-instruction gardening-guides multimedia-sequencing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

8

Forks

Language

Python

License

MIT

Last pushed

Mar 18, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/EagleW/Multimedia-Generative-Script-Learning"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.