michelecafagna26/VinVL

Original VinVL (and Oscar) repo with API designed for an easy inference

20
/ 100
Experimental

This project helps developers integrate powerful image captioning and scene description capabilities into their applications. It takes visual features extracted from images, along with optional object labels, and generates natural language captions or detailed scene descriptions. This is primarily used by developers who need to add advanced vision-language understanding to their software, such as for content moderation, accessibility features, or automated content generation.

No commits in the last 6 months.

Use this if you are a developer looking for an easy-to-use API to add state-of-the-art image captioning or scene description generation to your Python application.

Not ideal if you are an end-user without programming experience, as this tool requires coding to implement and use.

image-captioning computer-vision natural-language-generation application-development AI-integration
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 8 / 25
Community 8 / 25

How are scores calculated?

Stars

8

Forks

1

Language

Python

License

Last pushed

Jun 27, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/michelecafagna26/VinVL"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.