yuanze-lin/REVIVE
[NeurIPS 2022] Official code for REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
This project helps researchers and developers in artificial intelligence to improve the performance of systems that answer complex questions about images by leveraging external knowledge. It takes an image and a question as input, and outputs a more accurate answer by focusing on specific regions within the image and their relationships. This is ideal for those working on advanced AI applications requiring deep visual and knowledge-based reasoning.
105 stars. No commits in the last 6 months.
Use this if you are developing or researching AI models that need to answer detailed questions about images by integrating both visual information and external knowledge, and you want to achieve state-of-the-art accuracy.
Not ideal if your task involves only simple image recognition or traditional visual question answering without the need for external, factual knowledge.
Stars
105
Forks
13
Language
Python
License
—
Category
Last pushed
Apr 06, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/yuanze-lin/REVIVE"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
adambielski/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch
HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis
Papers, code and datasets about deep learning and multi-modal learning for video analysis
KaiyangZhou/pytorch-vsumm-reinforce
Unsupervised video summarization with deep reinforcement learning (AAAI'18)