ProGamerGov/VLM-Captioning-Tools

Python scripts to use for captioning images with VLMs

26
/ 100
Experimental

This tool helps automate the process of describing large collections of images using AI. You provide it with one or more folders of images, and it outputs a structured file containing detailed and short textual descriptions for each image. This is ideal for researchers, content managers, or anyone needing to categorize or search through many images.

No commits in the last 6 months.

Use this if you need to automatically generate comprehensive captions for thousands or millions of images.

Not ideal if you only have a few images to caption or if you need highly specialized captions requiring human expert knowledge.

image-cataloging digital-asset-management visual-content-analysis data-labeling media-archive
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

45

Forks

Language

Python

License

MIT

Last pushed

Apr 23, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ProGamerGov/VLM-Captioning-Tools"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.