lahfir/agent-desktop

Native desktop automation CLI for AI agents. Control any application through OS accessibility trees with structured JSON output and deterministic element refs.

29
/ 100
Experimental

This tool helps automate tasks across any native desktop application on macOS, allowing you to interact with software like Safari, Slack, or System Settings programmatically. It takes commands to observe, click, type, or manage windows, and outputs structured JSON responses, enabling you to build AI agents that can control your computer. Developers and AI engineers can use this to create automated workflows and intelligent assistants that interact with desktop applications.

Use this if you need to build AI agents or automated scripts that interact directly with macOS desktop applications through their accessibility features, rather than just web browsers.

Not ideal if you are looking for a visual, low-code automation builder or if you primarily need to automate tasks within web browsers or specific, well-supported applications with existing APIs.

desktop-automation AI-agent-development workflow-automation scripting macOS-automation
No License No Package No Dependents
Maintenance 10 / 25
Adoption 7 / 25
Maturity 3 / 25
Community 9 / 25

How are scores calculated?

Stars

32

Forks

3

Language

Rust

License

Last pushed

Mar 11, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/lahfir/agent-desktop"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.