abdur75648/V-Zen

V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM Resources

36
/ 100
Emerging

This project helps with understanding and automating interactions with graphical user interfaces (GUIs). It takes visual information from a GUI and translates it into actionable commands, allowing for precise control and navigation. This is designed for engineers or developers building automated workflows for applications and software.

No commits in the last 6 months.

Use this if you need to build advanced automation tools that can accurately interpret and interact with complex graphical interfaces.

Not ideal if you are looking for an off-the-shelf, no-code automation solution for simple tasks.

GUI-automation software-testing robotics-process-automation developer-tooling application-control
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

9

Forks

4

Language

License

Apache-2.0

Last pushed

Jul 21, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/abdur75648/V-Zen"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.