dolphin-creator/VideoContext-Engine
Local Video RAG Engine. A FastAPI microservice for video understanding: Scene Detection + Whisper ASR + Qwen3-VL. Optimized for Apple Silicon (MLX) & Windows/Linux (Llama.cpp).
This tool helps video analysts, content creators, and researchers understand video content by automatically detecting scenes, transcribing audio, and describing visuals. You input a video file or URL, and it outputs a structured JSON report or plain text summary with detailed scene descriptions, audio transcripts, and overall video context. It's designed for anyone needing deep insights from video without relying on external cloud services.
Use this if you need to extract detailed, structured information from videos, including visual descriptions and audio transcripts, and want to perform this analysis entirely on your local machine.
Not ideal if you require real-time video processing for extremely long videos on Windows/Linux with default settings, or if you prefer a fully cloud-based, managed solution.
Stars
26
Forks
2
Language
Python
License
—
Category
Last pushed
Dec 04, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/dolphin-creator/VideoContext-Engine"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVIDIA-AI-Blueprints/video-search-and-summarization
Blueprint for Ingesting massive volumes of live or archived videos and extract insights for...
kaya70875/ytfetcher
⚡ Build structured YouTube datasets at scale — effortlessly fetch transcripts and rich metadata...
HKUDS/VideoRAG
[KDD'2026] "VideoRAG: Chat with Your Videos"
jonaskahn/asktube
AskTube - An AI-powered YouTube video summarizer and QA assistant powered by Retrieval Augmented...
wassim249/YT-Navigator
YT Navigator: AI-powered YouTube content explorer that lets you search and chat with channel...