Shivanshu-Gupta/Visual-Question-Answering

CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering

/ 100

Emerging

This system helps you automatically answer natural language questions about images. You provide an image and a question like "What color is the car?" or "How many people are in the picture?", and it generates a natural language answer based on the image content. This is for researchers or data scientists who need to analyze or extract information from large image collections using textual queries.

No commits in the last 6 months.

Use this if you need to build or experiment with AI models that can interpret visual information from images and provide textual answers to related questions.

Not ideal if you're looking for a ready-to-use, production-grade application for general image search or descriptive captioning, as this is a research-oriented toolkit.

image-analysis question-answering computer-vision natural-language-processing AI-research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 19 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

asahi417/lm-question-generation

Multilingual/multidomain question generation datasets, models, and python library for question...

SparkJiao/SLQA

An Unofficial Pytorch Implementation of Multi-Granularity Hierarchical Attention Fusion Networks...

MurtyShikhar/Question-Answering

TensorFlow implementation of Match-LSTM and Answer pointer for the popular SQuAD dataset.

hsinyuan-huang/FlowQA

Implementation of conversational QA model: FlowQA (with slight improvement)

allenai/aokvqa

Official repository for the A-OKVQA dataset

Explore NLP Tools

All categories Trending NLP directory Insights