Rumeysakeskin/Image-Captioning
Image captioning with a benchmark of CNN-based encoder and GRU-based inject-type (init-inject, pre-inject, par-inject) and merge decoder architectures
This project helps researchers and developers explore different ways to automatically describe what's in an image using text. You input an image, and it outputs several potential captions. This is useful for anyone working on computer vision or natural language generation tasks.
No commits in the last 6 months.
Use this if you are a researcher or developer prototyping or benchmarking different image captioning model architectures, particularly those focused on how image features are integrated into text generation.
Not ideal if you need a ready-to-use, high-performance image captioning system for immediate deployment in a production environment without further development.
Stars
8
Forks
—
Language
Python
License
—
Category
Last pushed
Oct 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Rumeysakeskin/Image-Captioning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ntrang086/image_captioning
generate captions for images using a CNN-RNN model that is trained on the Microsoft Common...
fregu856/CS224n_project
Neural Image Captioning in TensorFlow.
vacancy/SceneGraphParser
A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic...
ltguo19/VSUA-Captioning
Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019
Abdelrhman-Yasser/video-content-description
Video content description model for generating descriptions for unconstrained videos