dolphin-creator/VideoContext-Engine

Local Video RAG Engine. A FastAPI microservice for video understanding: Scene Detection + Whisper ASR + Qwen3-VL. Optimized for Apple Silicon (MLX) & Windows/Linux (Llama.cpp).

25
/ 100
Experimental

This tool helps video analysts, content creators, and researchers understand video content by automatically detecting scenes, transcribing audio, and describing visuals. You input a video file or URL, and it outputs a structured JSON report or plain text summary with detailed scene descriptions, audio transcripts, and overall video context. It's designed for anyone needing deep insights from video without relying on external cloud services.

Use this if you need to extract detailed, structured information from videos, including visual descriptions and audio transcripts, and want to perform this analysis entirely on your local machine.

Not ideal if you require real-time video processing for extremely long videos on Windows/Linux with default settings, or if you prefer a fully cloud-based, managed solution.

video-analysis content-moderation media-research digital-asset-management video-indexing
No License No Package No Dependents
Maintenance 6 / 25
Adoption 7 / 25
Maturity 5 / 25
Community 7 / 25

How are scores calculated?

Stars

26

Forks

2

Language

Python

License

Last pushed

Dec 04, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/dolphin-creator/VideoContext-Engine"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.