Cadene/vqa.pytorch

Visual Question Answering in Pytorch

43
/ 100
Emerging

This project offers a solution for 'Visual Question Answering' (VQA), a task where a computer answers questions about an image. You provide an image and a question related to its content, and the system delivers a short, factual answer. This is primarily for researchers and developers working on advanced AI for image comprehension and human-to-machine interaction.

735 stars. No commits in the last 6 months.

Use this if you are a researcher or AI developer working on multimodal AI systems and need to train or evaluate models for understanding visual content and answering natural language questions about it.

Not ideal if you are looking for a ready-to-use application or API for general image search or descriptive captioning, as this focuses on the specific VQA research task.

visual question answering multimodal AI computer vision research natural language processing human-computer interaction
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 25 / 25

How are scores calculated?

Stars

735

Forks

179

Language

Python

License

Last pushed

Dec 11, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Cadene/vqa.pytorch"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.