chasemetoyer/gameplay-vision-llm

Multimodal gameplay video understanding system combining vision, audio, and language models to enable long-horizon reasoning and question-answering over complex game environments.

/ 100

Emerging

This project helps video game analysts, testers, and content creators understand complex gameplay by analyzing video and audio footage. It takes raw gameplay video as input and provides detailed answers to natural language questions about in-game events, player actions, and strategic outcomes. This is designed for anyone needing deep insights into game performance and mechanics without manual frame-by-frame analysis.

Use this if you need to deeply analyze gameplay videos, understand 'why' certain events happened, track player strategies, or generate detailed summaries of long play sessions.

Not ideal if you're looking for a simple video editing tool or if your primary need is general video content recognition outside of game environments.

gameplay-analysis video-game-testing esports-analytics game-development gaming-content-creation

No Package No Dependents

Maintenance 6 / 25

Adoption 5 / 25

Maturity 13 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

llmware-ai/llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Sinapsis-AI/sinapsis-chatbots

Monorepo for sinapsis templates supporting LLM based Agents

aimclub/ProtoLLM

Framework for prototyping of LLM-based applications

Azure-Samples/azureai-foundry-finetuning-raft

A recipe that will walk you through using either Meta Llama 3.1 405B or OpenAI GPT-4o deployed...

xi029/Qwen3-VL-MoeLORA

在千问最新的多模态image-text模型Qwen3-VL-4B-Instruct 进行多种lora微调对比效果，通过langchain+RAG+多智能体(Multi-Agent)进行部署

Explore RAG Tools

All categories Trending RAG directory Insights