Image Caption Generation ML Frameworks
Applications and models for automatically generating textual descriptions of images using deep learning architectures (CNNs, RNNs, Transformers). Does NOT include sketch segmentation, image-to-audio conversion, or general object detection without caption output.
There are 70 image caption generation frameworks tracked. The highest-rated is tonybeltramelli/pix2code at 47/100 with 12,051 stars.
Get all 70 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=image-caption-generation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
tonybeltramelli/pix2code
pix2code: Generating Code from a Graphical User Interface Screenshot |
|
Emerging |
| 2 |
bobbens/sketch_simplification
Models and code related to sketch simplification of rough sketches. |
|
Emerging |
| 3 |
emilwallner/Screenshot-to-code
A neural network that transforms a design mock-up into a static website. |
|
Emerging |
| 4 |
MiteshPuthran/Image-Caption-Generator
The LSTM model generates captions for the input images after extracting... |
|
Emerging |
| 5 |
jchenghu/ExpansionNet_v2
Implementation code of the work "Exploiting Multiple Sequence Lengths in... |
|
Emerging |
| 6 |
shagunsodhani/Image-Caption-Generator
A simple implementation of neural image caption generator |
|
Emerging |
| 7 |
Y-debug-sys/UCL-sketch
[IEEE TKDE] Official Implementation of "Learning-based Sketches for... |
|
Emerging |
| 8 |
val-iisc/sketch-parse
Code, demos and data for SketchParse (a neural network for sketch... |
|
Emerging |
| 9 |
aimagelab/camel
CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022 |
|
Emerging |
| 10 |
hlamba28/Automatic-Image-Captioning
Generating Captions for images using Deep Learning |
|
Emerging |
| 11 |
ashnkumar/sketch-code
Keras model to generate HTML code from hand-drawn website mockups.... |
|
Emerging |
| 12 |
mzbac/sketch2code
a simple model that implemented sketch to code |
|
Emerging |
| 13 |
riad5089/Image_Caption_Generator
This is a Deep Learning model which uses Computer Vision and NLP to generate... |
|
Emerging |
| 14 |
nasib-ullah/video-captioning-models-in-Pytorch
A PyTorch implementation of state of the art video captioning models from... |
|
Emerging |
| 15 |
dhruvik-patel/image-description
This repo represents our machine learning project Image Description which is... |
|
Emerging |
| 16 |
llegomark/openai-gpt4-vision
This repository contains a simple image captioning app that utilizes... |
|
Emerging |
| 17 |
Dantekk/Image-Captioning
Image Captioning using CNN and Transformer. |
|
Emerging |
| 18 |
GvHemanth/Image-to-Speech-Generation_Encoder-Attention-Decoder
This project aims to assist visually impaired individuals by providing a... |
|
Emerging |
| 19 |
ChaitanyaC22/Udacity-CVND-Project2-Automated-Image-Captioning
This project aims at training a CNN-RNN model to predict captions for a... |
|
Emerging |
| 20 |
nirajankarki5/Flickr30k-Image-Caption-Generator-Using-Deep-Learning
A deep learning model that generates descriptions of an image. |
|
Emerging |
| 21 |
ArchAngelAries/TagScribeR
A tool to streamline AI image captioning |
|
Emerging |
| 22 |
ammarlodhi255/image-captioning-system-to-assist-the-blind
An image captioning system that is able to predict and speak out a caption... |
|
Emerging |
| 23 |
nextml/caption-contest-data
Data from the caption contest. |
|
Emerging |
| 24 |
Aryavir07/Image-Captioning-Using-CNN-and-LSTM
Generating Captions for images using CNN & LSTM on Flickr8K dataset.The... |
|
Emerging |
| 25 |
qyzdao/Sketch-Based-Deep-Learning
A resource repository for sketch based deep learning papers |
|
Emerging |
| 26 |
IEEE-NITK/Image_Captioning
Image Captioning is the process of generating textual description of an... |
|
Experimental |
| 27 |
iamirmasoud/image_captioning
Automatic Image Captioning using PyTorch on MS COCO dataset |
|
Experimental |
| 28 |
prasadgujar/CapSearch
An Image Caption Generation based search |
|
Experimental |
| 29 |
eddisonpham/DynaStride
Dynamic Stride Windowing with MMCoT for Multi-Scene Captioning |
|
Experimental |
| 30 |
arunadurai/Eye-For-Blind
The aim of this project is to summarize the image using deep learning techniques |
|
Experimental |
| 31 |
nico1008/paint2code
Paint2code - a lightweight tool designed to transform your hand-drawn... |
|
Experimental |
| 32 |
iFairPlay22/The-Describer
Ecosystème permettant de donner l'accès à la description d'images au... |
|
Experimental |
| 33 |
AmirhosseinHonardoust/Image-Captioning-CNN-LSTM
An end-to-end image captioning project using a CNN encoder (ResNet-50) and... |
|
Experimental |
| 34 |
Arbazkhan-cs/AI-Powered-Image-Captioning
🖼️ AI-Powered Image Captioning: Seamlessly generate captions for images... |
|
Experimental |
| 35 |
ayushman72/ImageCaptioning
An AI model to caption images |
|
Experimental |
| 36 |
chandana-galgali/Automated-Caption-Generation-using-Encoder-Decoder-Model
An end-to-end Computer Vision and NLP project capable of classifying jewelry... |
|
Experimental |
| 37 |
Rumeysakeskin/IMECA
Automatic image captioning on Android-based mobile application with CNN and... |
|
Experimental |
| 38 |
parask11/image-captioner
Generates suitable captions for the images of people and animals input by the user. |
|
Experimental |
| 39 |
snehalathaArakkonam/Img_CapGenerator
Generates captions for images using a CNN encoder and LSTM decoder trained... |
|
Experimental |
| 40 |
amanptl/quote-it
Quote It! will be a Software-as-a-Service platform that aims to solve the... |
|
Experimental |
| 41 |
dayyass/image-captioning
My solution to the Image Captioning Final Project of the Coursera... |
|
Experimental |
| 42 |
iVishalr/Scene-Describer
Video Timestamp recommendation using Transfer Learning and NLP |
|
Experimental |
| 43 |
Aryan0419/Image-Captioning-CNN-LSTM
🖼️ Generate descriptive captions for images using a CNN-LSTM model,... |
|
Experimental |
| 44 |
harshwalia36/Audio-Description-of-Image-for-visually-impaired-person
Mini Project for Btech which helps the visually impaired person to get the... |
|
Experimental |
| 45 |
VaiBhaVSinGh91/ImageCaption
This repository contains an implementation of an image captioning model that... |
|
Experimental |
| 46 |
kr1shnasomani/CaptionCraft
Image Captioner using DenseNet201 and LSTM |
|
Experimental |
| 47 |
willyfh/msvd-indonesian
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian... |
|
Experimental |
| 48 |
aliahmad552/image-caption-generator-using-deeplearning-nlp
This project implements an Image Caption Generator, a deep learning model... |
|
Experimental |
| 49 |
nicolafan/neural-artwork-caption-generator
Code for the paper "Exploring the Synergy Between Vision-Language... |
|
Experimental |
| 50 |
ehsan-torabi/Draw2Matrix
Draw2Matrix — Draw sketches and instantly convert them into exportable... |
|
Experimental |
| 51 |
Smile040501/image_captioning
Generates textual description of any given image. Use both Natural Language... |
|
Experimental |
| 52 |
siddhali24/VISCRIBE-project
Visual Describe - Object Detection and Caption Generation Using YOLO |
|
Experimental |
| 53 |
Aditya-ha11/vlm-onnx-comparison
Vision-Language Captioning using PyTorch vs ONNX with performance benchmarking |
|
Experimental |
| 54 |
vivek-kumar9/Labelly--Image-Labelling-app-using-CNN-and-LSTMs
Image captioning application using a CNN–LSTM encoder–decoder architecture... |
|
Experimental |
| 55 |
paazmaya/sesoko
Prepare and caption images for using them as training data |
|
Experimental |
| 56 |
theSohamTUmbare/CAPbot
My discord bot that generate the captions for the images |
|
Experimental |
| 57 |
jaychampaneri14/image-captioning
CNN-LSTM image captioning with attention mechanism |
|
Experimental |
| 58 |
yashwanthreddytangella-alt/image-captioning-attention
Image captioning (ResNet encoder + attention LSTM) — data prep, training,... |
|
Experimental |
| 59 |
Amir-Hofo/Image_Captioning
Image captioning project using the Flickr8k dataset. A custom... |
|
Experimental |
| 60 |
mahendranandi/Image_Captioning
Image captioning using ResNet50 and LSTM in keras library. An application of... |
|
Experimental |
| 61 |
SayamAlt/Image-Caption-Generation-using-ResNet-and-LSTMs
Successfully developed an image caption generation model which can precisely... |
|
Experimental |
| 62 |
OmarAtyqy/attention-object-based-captioning
Attention Based Image Captioning Using Object Features |
|
Experimental |
| 63 |
Thej-Venkat-P/Image-Caption-Generator
Generates Captions for given Images |
|
Experimental |
| 64 |
jman4162/CVND-image-captioning
CNN-RNN image captioning project for Udacity's Computer Vision Nanodegree... |
|
Experimental |
| 65 |
SaharshLaud/Image-Captioning-App
Deep Learning based application for generating caption for uploaded images. |
|
Experimental |
| 66 |
Pradnya1208/Image-Caption-Generator
The aim of this project is to build an Image caption Generator using VGG16... |
|
Experimental |
| 67 |
Jimil1407/image_detection
Image Caption Generator - A model that generates descriptive captions for... |
|
Experimental |
| 68 |
jewelryling/jewelry_linguistics
Automatic Identification and Description of Jewelry Through Computer Vision... |
|
Experimental |
| 69 |
AHMEDSANA/PaliGemma-flickr8k-finetuning
This repository contains code for fine-tuning Google's PaliGemma... |
|
Experimental |
| 70 |
ElsonFilho/Python-AI-Applications
Modern AI applications in Python - image captioning, audio transcription... |
|
Experimental |