jshilong/GPT4RoI

(ECCVW 2025)GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

41
/ 100
Emerging

This project helps anyone who needs to understand specific details within images by allowing you to point out regions of interest and ask questions about them. You input an image and highlight specific areas, then ask questions using natural language. The output is a descriptive answer focusing solely on the selected region(s), making it ideal for visual content analysis. This is for professionals like image analysts, researchers, or content moderators.

551 stars. No commits in the last 6 months.

Use this if you need to ask precise, natural language questions about specific parts of an image and get targeted answers.

Not ideal if you need a general image caption or overall scene description without specifying regions of interest.

image-analysis visual-question-answering content-moderation research-imaging document-annotation
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

551

Forks

28

Language

Python

License

Last pushed

Jun 03, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/jshilong/GPT4RoI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.