OpenAdaptAI/OmniMCP
OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction capabilities.
This project helps developers automate interactions with graphical user interfaces (GUIs) on their computer. It takes a high-level goal, like "open calculator and compute 5*9," analyzes the screen, plans the necessary mouse clicks and keyboard inputs, and then executes them. A developer would use this to programmatically control applications as if a human were interacting with them.
No commits in the last 6 months.
Use this if you are a developer looking to automate complex GUI workflows or test applications through visual interaction rather than API calls.
Not ideal if you need a simple script for keyboard shortcuts or already have an application with a robust API for direct control.
Stars
71
Forks
16
Language
Python
License
—
Category
Last pushed
Apr 08, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mcp/OpenAdaptAI/OmniMCP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jonigl/mcp-client-for-ollama
A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features...
ArcadeAI/arcade-mcp
The best way to create, deploy, and share MCP Servers
Dicklesworthstone/ultimate_mcp_server
Comprehensive MCP server exposing dozens of capabilities to AI agents: multi-provider LLM...
hmldns/nautex
MCP server for guiding Coding Agents via end-to-end requirements to implementation plan pipeline
SecretiveShell/MCP-Bridge
A middleware to provide an openAI compatible endpoint that can call MCP tools