eric-ai-lab/Screen-Point-and-Read

Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

27
/ 100
Experimental

This project helps anyone who struggles to understand on-screen information, especially those who rely on screen readers. By simply pointing to an area on a digital screen, it provides a clear description of the content in that specific spot, along with how it's organized and related to other elements. This tool is designed for end-users who need to accurately interpret complex or unfamiliar graphical interfaces, enhancing accessibility and comprehension.

No commits in the last 6 months.

Use this if you need to understand specific content on a GUI screen, particularly its layout and spatial relationships, just by pointing to it.

Not ideal if you're looking for a general-purpose screen reader that reads aloud all elements sequentially without specific point-and-read functionality.

digital-accessibility GUI-comprehension screen-reading user-interface-navigation visual-impairment-support
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 12 / 25

How are scores calculated?

Stars

29

Forks

4

Language

Python

License

Last pushed

Jul 31, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/eric-ai-lab/Screen-Point-and-Read"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.