Fzkuji/GUI-Agent-Harness

🦞 Vision-based desktop automation skills for OpenClaw agents on macOS. See, learn, click — any app.

34
/ 100
Emerging

This tool helps macOS users automate repetitive desktop tasks by letting an AI agent "see" and interact with any application on the screen, just like a human. You provide a high-level instruction, and the system processes visual input from your screen to perform clicks, typing, and navigation across different apps. It's designed for anyone who needs to automate multi-step workflows that involve various desktop applications, without needing to write complex scripts or code.

Use this if you need to automate workflows across multiple macOS applications, such as managing emails, updating spreadsheets, or navigating web interfaces, with natural language commands.

Not ideal if your automation needs are limited to a single application with a well-defined API, or if you require cross-platform support beyond macOS.

desktop-automation workflow-automation macOS-productivity digital-assistant task-management
No Package No Dependents
Maintenance 13 / 25
Adoption 6 / 25
Maturity 11 / 25
Community 4 / 25

How are scores calculated?

Stars

21

Forks

1

Language

Python

License

MIT

Category

browser-agent

Last pushed

Apr 04, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/Fzkuji/GUI-Agent-Harness"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.