zarzouram/image_captioning_with_transformers

Pytorch implementation of image captioning using transformer-based model.

38
/ 100
Emerging

This project helps machine learning practitioners or researchers automatically generate descriptive captions for images. It takes a collection of images and their associated captions as input, processes them, trains a model, and then outputs newly generated text descriptions for unseen images. It is ideal for those working on computer vision and natural language processing tasks.

No commits in the last 6 months.

Use this if you are a machine learning researcher or student who needs to implement, train, and evaluate a transformer-based model for image captioning.

Not ideal if you are looking for a ready-to-use, off-the-shelf application for image captioning without delving into model training and configuration.

image-captioning computer-vision natural-language-processing machine-learning-research deep-learning-models
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

68

Forks

9

Language

Jupyter Notebook

License

MIT

Last pushed

Apr 13, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/zarzouram/image_captioning_with_transformers"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.