xjywhu/Awesome-Multimodal-LLM-for-Code

Multimodal Large Language Models for Code Generation under Multimodal Scenarios

40
/ 100
Emerging

This is a curated collection of research papers and benchmarks on tools that automatically generate code from various visual inputs. It covers methods for converting UI designs (like screenshots or mockups) into front-end code for web and mobile apps, or turning scientific plots and charts into the code that generated them. Software developers, UI/UX designers, and researchers in AI and software engineering would find this useful for understanding the state-of-the-art in multimodal code generation.

221 stars.

Use this if you are a developer, designer, or researcher interested in generating various types of code (UI, scientific plots, etc.) directly from visual or multi-modal descriptions and want to explore the latest academic advancements and benchmarks in this field.

Not ideal if you are looking for a ready-to-use software tool or library to implement multimodal code generation directly, as this repository primarily links to academic papers.

AI-driven development front-end development UI/UX design scientific visualization code generation
No License No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 9 / 25

How are scores calculated?

Stars

221

Forks

9

Language

License

Last pushed

Mar 16, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/xjywhu/Awesome-Multimodal-LLM-for-Code"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.