kyegomez/VisionDatasets

Open source scripts to create large scale datasets with rich detail for multi-modal models

/ 100

Experimental

This project helps create highly detailed, large-scale image and video datasets for training advanced AI models. It takes raw images and videos and processes them into structured datasets suitable for improving computer vision and multi-modal AI systems. AI researchers, machine learning engineers, and data scientists working on visual AI applications would find this useful.

No commits in the last 6 months.

Use this if you need to build comprehensive, high-quality datasets to train or fine-tune AI models that understand and generate content across different data types like images and text.

Not ideal if you are looking for pre-built, ready-to-use datasets rather than tools to create your own from scratch.

AI training machine learning datasets computer vision data curation AI research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

rom1504/img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M...

devrimcavusoglu/pybboxes

Light weight toolkit for bounding boxes providing conversion between bounding box types and...

PyRetri/PyRetri

Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥

Particle1904/DatasetHelpers

Dataset Helper program to automatically select, re scale and tag Datasets (composed of image and...

salesforce/LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Explore ML Frameworks

All categories Trending ML Framework directory Insights