IDEA-Research/DINO-X-MCP
Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.
This project integrates DINO-X's advanced image analysis capabilities into large language models (LLMs), allowing them to "see" and understand images. It takes an image and optional text prompts as input, and outputs detailed information about objects within the image, including their categories, counts, locations, and descriptive captions. This tool is designed for developers building multimodal AI applications that require rich visual context from images.
113 stars.
Use this if you are a developer building an AI application or agent that needs to understand and describe the contents of images, perform object counting, or locate specific features within an image.
Not ideal if you are looking for a standalone end-user application for image editing or simple photo organization without integrating it into a larger AI system.
Stars
113
Forks
11
Language
TypeScript
License
Apache-2.0
Category
Last pushed
Oct 28, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mcp/IDEA-Research/DINO-X-MCP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
shinpr/mcp-image
MCP server for AI image generation and editing with automatic prompt optimization and quality...
ifmelate/mcp-image-extractor
MCP server which allow LLM in agent mode to analyze image whenever it needs
joenorton/comfyui-mcp-server
lightweight Python-based MCP (Model Context Protocol) server for local ComfyUI
raveenb/fal-mcp-server
MCP server for Fal.ai - Generate images, videos, music and audio with Claude
glifxyz/glif-mcp-server
Easily run glif.app AI workflows inside your LLM: image generators, memes, selfies, and more....