ymrohit/openscenesense-ollama
OpenSceneSense Ollama is a Python library that harnesses AI for advanced local video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.
This tool helps you understand what's happening in your videos by analyzing both visuals and audio. It takes a video file as input and produces detailed summaries, key visual timelines, and audio transcripts. Anyone who needs to quickly grasp the content of video footage, such as media analysts, educators, or content moderators, would find this useful.
Available on PyPI.
Use this if you need to analyze video content for key events, spoken words, and overall themes, all processed privately on your own computer without internet dependency.
Not ideal if you need a cloud-based solution for scalability, do not have the technical expertise to set up local AI models, or require real-time, ultra-low latency video stream analysis.
Stars
44
Forks
8
Language
Python
License
MIT
Category
Last pushed
Jan 05, 2026
Commits (30d)
0
Dependencies
8
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/ymrohit/openscenesense-ollama"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.