Linda5823/Magic-Point-to-Read-V3

🪄 Magic Point-to-Read: An interactive AI reading assistant using Google Gemini (Vision OCR & TTS) to turn any image into clickable, audible learning material. 一个利用 Gemini 实现的交互式点读笔，支持图片识别、翻译与语音朗读。

/ 100

Experimental

No Package No Dependents

Maintenance 10 / 25

Adoption 0 / 25

Maturity 11 / 25

Community 0 / 25

How are scores calculated?

Stars

—

Forks

—

Language

TypeScript

License

MIT

Category

vision-agent-platforms

Last pushed

Feb 10, 2026

Commits (30d)

GitHub

Vision Agent Platforms · 36 agents

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/Linda5823/Magic-Point-to-Read-V3"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

GetStream/Vision-Agents

Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses...

video-db/videodb-capture-quickstart

Give your agents real time desktop perception. Stream screen, microphone, and system audio for...

sijeeshmiziha/visionagent

Multi-provider AI agent framework with vision capabilities and tool calling. Supports OpenAI,...

grctest/g3n-fastapi-webcam-docker

Utilizing multiple Gemma 3n agents to analyze webcam footage

leukaemiamedtech/hias-tassai-facial-recognition

HIAS TassAI Facial Recognition Agent processes streams from local or remote cameras to identify...

Explore AI Agents

All categories Trending AI Agent directory Insights