Image Caption Generation ML Frameworks

Applications and models for automatically generating textual descriptions of images using deep learning architectures (CNNs, RNNs, Transformers). Does NOT include sketch segmentation, image-to-audio conversion, or general object detection without caption output.

There are 70 image caption generation frameworks tracked. The highest-rated is tonybeltramelli/pix2code at 47/100 with 12,051 stars.

Get all 70 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=image-caption-generation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 tonybeltramelli/pix2code

pix2code: Generating Code from a Graphical User Interface Screenshot

47
Emerging
2 bobbens/sketch_simplification

Models and code related to sketch simplification of rough sketches.

47
Emerging
3 emilwallner/Screenshot-to-code

A neural network that transforms a design mock-up into a static website.

46
Emerging
4 MiteshPuthran/Image-Caption-Generator

The LSTM model generates captions for the input images after extracting...

45
Emerging
5 jchenghu/ExpansionNet_v2

Implementation code of the work "Exploiting Multiple Sequence Lengths in...

45
Emerging
6 shagunsodhani/Image-Caption-Generator

A simple implementation of neural image caption generator

42
Emerging
7 Y-debug-sys/UCL-sketch

[IEEE TKDE] Official Implementation of "Learning-based Sketches for...

42
Emerging
8 val-iisc/sketch-parse

Code, demos and data for SketchParse (a neural network for sketch...

41
Emerging
9 aimagelab/camel

CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022

41
Emerging
10 hlamba28/Automatic-Image-Captioning

Generating Captions for images using Deep Learning

41
Emerging
11 ashnkumar/sketch-code

Keras model to generate HTML code from hand-drawn website mockups....

40
Emerging
12 mzbac/sketch2code

a simple model that implemented sketch to code

39
Emerging
13 riad5089/Image_Caption_Generator

This is a Deep Learning model which uses Computer Vision and NLP to generate...

35
Emerging
14 nasib-ullah/video-captioning-models-in-Pytorch

A PyTorch implementation of state of the art video captioning models from...

35
Emerging
15 dhruvik-patel/image-description

This repo represents our machine learning project Image Description which is...

35
Emerging
16 llegomark/openai-gpt4-vision

This repository contains a simple image captioning app that utilizes...

34
Emerging
17 Dantekk/Image-Captioning

Image Captioning using CNN and Transformer.

34
Emerging
18 GvHemanth/Image-to-Speech-Generation_Encoder-Attention-Decoder

This project aims to assist visually impaired individuals by providing a...

32
Emerging
19 ChaitanyaC22/Udacity-CVND-Project2-Automated-Image-Captioning

This project aims at training a CNN-RNN model to predict captions for a...

31
Emerging
20 nirajankarki5/Flickr30k-Image-Caption-Generator-Using-Deep-Learning

A deep learning model that generates descriptions of an image.

30
Emerging
21 ArchAngelAries/TagScribeR

A tool to streamline AI image captioning

30
Emerging
22 ammarlodhi255/image-captioning-system-to-assist-the-blind

An image captioning system that is able to predict and speak out a caption...

30
Emerging
23 nextml/caption-contest-data

Data from the caption contest.

30
Emerging
24 Aryavir07/Image-Captioning-Using-CNN-and-LSTM

Generating Captions for images using CNN & LSTM on Flickr8K dataset.The...

30
Emerging
25 qyzdao/Sketch-Based-Deep-Learning

A resource repository for sketch based deep learning papers

30
Emerging
26 IEEE-NITK/Image_Captioning

Image Captioning is the process of generating textual description of an...

29
Experimental
27 iamirmasoud/image_captioning

Automatic Image Captioning using PyTorch on MS COCO dataset

29
Experimental
28 prasadgujar/CapSearch

An Image Caption Generation based search

28
Experimental
29 eddisonpham/DynaStride

Dynamic Stride Windowing with MMCoT for Multi-Scene Captioning

27
Experimental
30 arunadurai/Eye-For-Blind

The aim of this project is to summarize the image using deep learning techniques

27
Experimental
31 nico1008/paint2code

Paint2code - a lightweight tool designed to transform your hand-drawn...

27
Experimental
32 iFairPlay22/The-Describer

Ecosystème permettant de donner l'accès à la description d'images au...

26
Experimental
33 AmirhosseinHonardoust/Image-Captioning-CNN-LSTM

An end-to-end image captioning project using a CNN encoder (ResNet-50) and...

25
Experimental
34 Arbazkhan-cs/AI-Powered-Image-Captioning

🖼️ AI-Powered Image Captioning: Seamlessly generate captions for images...

24
Experimental
35 ayushman72/ImageCaptioning

An AI model to caption images

23
Experimental
36 chandana-galgali/Automated-Caption-Generation-using-Encoder-Decoder-Model

An end-to-end Computer Vision and NLP project capable of classifying jewelry...

23
Experimental
37 Rumeysakeskin/IMECA

Automatic image captioning on Android-based mobile application with CNN and...

23
Experimental
38 parask11/image-captioner

Generates suitable captions for the images of people and animals input by the user.

22
Experimental
39 snehalathaArakkonam/Img_CapGenerator

Generates captions for images using a CNN encoder and LSTM decoder trained...

22
Experimental
40 amanptl/quote-it

Quote It! will be a Software-as-a-Service platform that aims to solve the...

22
Experimental
41 dayyass/image-captioning

My solution to the Image Captioning Final Project of the Coursera...

22
Experimental
42 iVishalr/Scene-Describer

Video Timestamp recommendation using Transfer Learning and NLP

22
Experimental
43 Aryan0419/Image-Captioning-CNN-LSTM

🖼️ Generate descriptive captions for images using a CNN-LSTM model,...

22
Experimental
44 harshwalia36/Audio-Description-of-Image-for-visually-impaired-person

Mini Project for Btech which helps the visually impaired person to get the...

21
Experimental
45 VaiBhaVSinGh91/ImageCaption

This repository contains an implementation of an image captioning model that...

21
Experimental
46 kr1shnasomani/CaptionCraft

Image Captioner using DenseNet201 and LSTM

20
Experimental
47 willyfh/msvd-indonesian

MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian...

20
Experimental
48 aliahmad552/image-caption-generator-using-deeplearning-nlp

This project implements an Image Caption Generator, a deep learning model...

19
Experimental
49 nicolafan/neural-artwork-caption-generator

Code for the paper "Exploring the Synergy Between Vision-Language...

19
Experimental
50 ehsan-torabi/Draw2Matrix

Draw2Matrix — Draw sketches and instantly convert them into exportable...

18
Experimental
51 Smile040501/image_captioning

Generates textual description of any given image. Use both Natural Language...

17
Experimental
52 siddhali24/VISCRIBE-project

Visual Describe - Object Detection and Caption Generation Using YOLO

17
Experimental
53 Aditya-ha11/vlm-onnx-comparison

Vision-Language Captioning using PyTorch vs ONNX with performance benchmarking

15
Experimental
54 vivek-kumar9/Labelly--Image-Labelling-app-using-CNN-and-LSTMs

Image captioning application using a CNN–LSTM encoder–decoder architecture...

14
Experimental
55 paazmaya/sesoko

Prepare and caption images for using them as training data

14
Experimental
56 theSohamTUmbare/CAPbot

My discord bot that generate the captions for the images

13
Experimental
57 jaychampaneri14/image-captioning

CNN-LSTM image captioning with attention mechanism

13
Experimental
58 yashwanthreddytangella-alt/image-captioning-attention

Image captioning (ResNet encoder + attention LSTM) — data prep, training,...

13
Experimental
59 Amir-Hofo/Image_Captioning

Image captioning project using the Flickr8k dataset. A custom...

12
Experimental
60 mahendranandi/Image_Captioning

Image captioning using ResNet50 and LSTM in keras library. An application of...

12
Experimental
61 SayamAlt/Image-Caption-Generation-using-ResNet-and-LSTMs

Successfully developed an image caption generation model which can precisely...

12
Experimental
62 OmarAtyqy/attention-object-based-captioning

Attention Based Image Captioning Using Object Features

11
Experimental
63 Thej-Venkat-P/Image-Caption-Generator

Generates Captions for given Images

11
Experimental
64 jman4162/CVND-image-captioning

CNN-RNN image captioning project for Udacity's Computer Vision Nanodegree...

11
Experimental
65 SaharshLaud/Image-Captioning-App

Deep Learning based application for generating caption for uploaded images.

11
Experimental
66 Pradnya1208/Image-Caption-Generator

The aim of this project is to build an Image caption Generator using VGG16...

11
Experimental
67 Jimil1407/image_detection

Image Caption Generator - A model that generates descriptive captions for...

11
Experimental
68 jewelryling/jewelry_linguistics

Automatic Identification and Description of Jewelry Through Computer Vision...

11
Experimental
69 AHMEDSANA/PaliGemma-flickr8k-finetuning

This repository contains code for fine-tuning Google's PaliGemma...

11
Experimental
70 ElsonFilho/Python-AI-Applications

Modern AI applications in Python - image captioning, audio transcription...

10
Experimental