kyegomez/VisionDatasets
Open source scripts to create large scale datasets with rich detail for multi-modal models
This project helps create highly detailed, large-scale image and video datasets for training advanced AI models. It takes raw images and videos and processes them into structured datasets suitable for improving computer vision and multi-modal AI systems. AI researchers, machine learning engineers, and data scientists working on visual AI applications would find this useful.
No commits in the last 6 months.
Use this if you need to build comprehensive, high-quality datasets to train or fine-tune AI models that understand and generate content across different data types like images and text.
Not ideal if you are looking for pre-built, ready-to-use datasets rather than tools to create your own from scratch.
Stars
11
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 11, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/kyegomez/VisionDatasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M...
devrimcavusoglu/pybboxes
Light weight toolkit for bounding boxes providing conversion between bounding box types and...
PyRetri/PyRetri
Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥
Particle1904/DatasetHelpers
Dataset Helper program to automatically select, re scale and tag Datasets (composed of image and...
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence