Hasan8123/Multimodal-AI-Agent
This Multimodal AI Agent is a Streamlit application that uses Gemini 2.0 Flash to analyze video content alongside real-time web research. It enables users to upload videos and receive comprehensive, data-driven answers by synthesizing visual insights with live information from the internet.
Stars
—
Forks
—
Language
Python
License
—
Category
Last pushed
Feb 23, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/Hasan8123/Multimodal-AI-Agent"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alvinreal/awesome-autoresearch
A curated list of autonomous improvement loops, research agents, and autoresearch-style systems...
0xSteph/pentest-ai-agents
Turn Claude Code into your offensive security research assistant. Specialized AI subagents for...
wulinteousa2-hash/napari-chat-assistant
A local agent architecture for semantic-aware interaction between large language models and...
saksham-jain177/AI-Agent-based-Deep-Research
Deep Research AI Agent is a dual-agent system that conducts web-based research and generates...
theam/limina
Autonomous research harness for AI agents. Give it a measurable goal — it hypothesizes,...