gicheonkang/sglkt-visdial

🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"

/ 100

Emerging

This project offers a method to enhance the ability of AI models to engage in visual dialogue, enabling them to have more natural and comprehensive conversations about images. It takes image features and dialogue history as input and outputs improved model performance in understanding and generating responses. This is for researchers and practitioners working on advanced AI conversational agents or visual question answering systems.

No commits in the last 6 months.

Use this if you are developing AI models that need to understand and discuss the content of images in a multi-turn conversational format.

Not ideal if you are looking for a ready-to-use application or an end-user product for general visual conversation, as this is a research implementation.

visual dialogue conversational AI image understanding natural language processing AI research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

TheShadow29/awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

microsoft/XPretrain

Multi-modality pre-training

TheShadow29/zsgnet-pytorch

Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural...

TheShadow29/VidSitu

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

zeyofu/BLINK_Benchmark

This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can...

Explore NLP Tools

All categories Trending NLP directory Insights