Particle1904/DatasetHelpers
Dataset Helper program to automatically select, re scale and tag Datasets (composed of image and text) for Machine Learning training.
This tool helps creative professionals and AI artists prepare image and text datasets for machine learning. It takes folders of images and their accompanying text descriptions (captions or tags) and outputs a refined dataset. This is ideal for anyone working with visual generative AI models who needs to clean, tag, and standardize their training data.
224 stars.
Use this if you need to efficiently process large collections of images and text files, automatically generate descriptive tags, or standardize image dimensions for training AI models.
Not ideal if you only need a basic image viewer or editor without any AI-driven tagging or content-aware cropping functionalities.
Stars
224
Forks
13
Language
C#
License
MIT
Category
Last pushed
Feb 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Particle1904/DatasetHelpers"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M...
devrimcavusoglu/pybboxes
Light weight toolkit for bounding boxes providing conversion between bounding box types and...
PyRetri/PyRetri
Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
haltakov/natural-language-image-search
Search photos on Unsplash using natural language