Vision-Agents and visionagent

These are competitors offering similar multi-provider vision agent frameworks, though GetStream's production-ready platform with Stream's edge infrastructure and established adoption significantly outpaces the nascent, type-safe alternative.

Vision-Agents
73
Verified
visionagent
36
Emerging
Maintenance 20/25
Adoption 10/25
Maturity 24/25
Community 19/25
Maintenance 10/25
Adoption 6/25
Maturity 20/25
Community 0/25
Stars: 7,366
Forks: 574
Downloads:
Commits (30d): 46
Language: Python
License: Apache-2.0
Stars: 1
Forks:
Downloads: 107
Commits (30d): 0
Language: TypeScript
License: MIT
No risk flags
No risk flags

About Vision-Agents

GetStream/Vision-Agents

Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

This project helps you build AI assistants that can watch and understand live video and audio, responding in real-time. It takes live video and audio feeds and combines them with advanced AI models to produce intelligent insights or interactions, like real-time coaching or anomaly detection. This is for developers building interactive AI applications that require immediate understanding and response to visual and auditory cues.

real-time video analysis AI coaching live monitoring video analytics interactive AI experiences

About visionagent

sijeeshmiziha/visionagent

Multi-provider AI agent framework with vision capabilities and tool calling. Supports OpenAI, Anthropic, Google. Built-in Figma tools and Google Stitch integration. Type-safe with Zod validation.

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work