Linda5823/Magic-Point-to-Read-V3
🪄 Magic Point-to-Read: An interactive AI reading assistant using Google Gemini (Vision OCR & TTS) to turn any image into clickable, audible learning material. 一个利用 Gemini 实现的交互式点读笔,支持图片识别、翻译与语音朗读。
Stars
—
Forks
—
Language
TypeScript
License
MIT
Category
Last pushed
Feb 10, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/Linda5823/Magic-Point-to-Read-V3"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
GetStream/Vision-Agents
Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses...
video-db/videodb-capture-quickstart
Give your agents real time desktop perception. Stream screen, microphone, and system audio for...
sijeeshmiziha/visionagent
Multi-provider AI agent framework with vision capabilities and tool calling. Supports OpenAI,...
grctest/g3n-fastapi-webcam-docker
Utilizing multiple Gemma 3n agents to analyze webcam footage
leukaemiamedtech/hias-tassai-facial-recognition
HIAS TassAI Facial Recognition Agent processes streams from local or remote cameras to identify...