Screen Vision Automation Computer Vision Tools

Tools that use computer vision to analyze screen content and automate user interactions (clicking, typing, gaming actions). Includes real-time visual analysis, OCR-based automation, and AI-driven input simulation. Does NOT include general image segmentation, obstacle detection for accessibility, or tools without screen/visual automation components.

There are 20 screen vision automation tools tracked. 1 score above 70 (verified tier). The highest-rated is MaaXYZ/MaaFramework at 71/100 with 3,445 stars. 1 of the top 10 are actively maintained.

Get all 20 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=computer-vision&subcategory=screen-vision-automation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 MaaXYZ/MaaFramework

基于图像识别的自动化黑盒测试框架 | An automation black-box testing framework based on image...

71
Verified
2 stb-tester/stb-tester

Automated Testing for Set-Top Boxes and Smart TVs

63
Established
3 Villavu/Simba

Simba is a program used to repeat certain (complicated) tasks. Typically...

62
Established
4 xxreflextheone/AI-Aimbot

Open source AI powered aim assist written in Python for all* games.

60
Established
5 STMicroelectronics/meta-st-x-linux-ai

OpenEmbedded meta layer to install AI frameworks and tools for the STM32MPU series

57
Established
6 ai-hpc/ai-hardware-engineer-roadmap

From Kernel-Level Parallel Programming to Custom AI Inference Accelerator...

45
Emerging
7 gabrimatic/eyra

Real-time AI screen analysis from the terminal. Local inference, voice...

44
Emerging
8 Nyx0ra/lol-aram-mayhem-hextech-helper

🎮 基于计算机视觉 (RapidOCR) 的 LOL 大乱斗海克斯助手。自动识别屏幕选项,实时推荐来自 Blitz.gg 的高胜率海克斯。 | LOL...

42
Emerging
9 nicedreamzapp/nicedreamzapp

Building AI tools and learning as I go: mobile computer vision, medical ML,...

39
Emerging
10 kwel1x/Auto_aim

🎯 Capture and analyze visuals in real-time using YOLO, TensorRT, and DXGI...

33
Emerging
11 cflaviu/ai-devbox

GPU-enabled C++ development stack based on NVIDIA DeepStream

32
Emerging
12 aurintex/pai-os

Open-source AI wearable companion. Local-first multimodal perception (VLM &...

30
Emerging
13 karimm-ai/NiceShot_AI

A Python tool powered by computer vision to analyze gameplay videos and...

29
Experimental
14 PRITHIVSAKTHIUR/CUA-GUI-Operator

CUA-GUI-Operator is an experimental, advanced computer-use agent (CUA) and...

29
Experimental
15 ninja-otaku/Project_Aegis

AI gaming companion — screen capture from a separate device, Claude vision analysis

28
Experimental
16 levipereira/deepstream-sahi

Native GStreamer plugins that integrate SAHI (Slicing Aided Hyper Inference)...

26
Experimental
17 lianhuaandy/Brain

🧠 Connect, create, and earn with BRAIN—your social network for paid posts,...

24
Experimental
18 gabi123-cmd/eyes-ios

👁️ Detect obstacles in real-time using LiDAR technology, enhancing awareness...

23
Experimental
19 johsonx88888/Hachiware-Desktop-Pet

An AI-powered desktop pet based on Hachiware,featuring computer vision...

20
Experimental
20 aminethe01/open-typeless

🎤 Enable seamless voice input on macOS with push-to-talk functionality,...

19
Experimental