inuwamobarak/Image-captioning-ViT

Image Captioning Vision Transformers (ViTs) are transformer models that generate descriptive captions for images by combining the power of Transformers and computer vision. It leverages state-of-the-art pre-trained ViT models and employs technique

/ 100

Experimental

This project helps generate descriptive captions for images, automating a task that typically requires manual observation and typing. You input an image, and it outputs a human-like textual description of what's in the picture. This is useful for anyone working with large collections of images, such as content managers, digital archivists, or e-commerce professionals.

No commits in the last 6 months.

Use this if you need to automatically generate clear, concise text descriptions for a collection of images.

Not ideal if you're looking for a ready-to-use application without any development or coding experience, as this is a developer-focused tool.

image-management content-creation digital-asset-management e-commerce accessibility-enhancement

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

stevan-milovanovic/LiteRT-for-Android

Image Classification, Image Captioning and LLM inference with LiteRT

ABX9801/Image-Caption-Generator

A Web App to generate caption for Images. VGG-16 Model is used to encode the images and...

ekkonwork/qwen3-vl-autotagger-cli

Standalone CLI for Qwen3-VL auto-tagging with optional XMP embedding.

floydhub/pix2code-template

Build a neural network to code a basic a HTML and CSS website based on a picture of a design mockup.

regiellis/ecko-cli

ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory,...

Explore Generative AI Tools

All categories Trending Generative AI directory Insights