BAAI-DCAI/DataOptim

A collection of visual instruction tuning datasets.

/ 100

Emerging

This repository provides a streamlined collection of datasets to train Multimodal Large Language Models (MLLMs) more effectively. It takes various image and text question-answering datasets, standardizes them, and outputs ready-to-use training data in a format suitable for MLLM development. Machine learning engineers and researchers working on building or fine-tuning MLLMs would use this.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher who needs pre-processed, high-quality visual instruction tuning datasets to train or fine-tune your Multimodal Large Language Models.

Not ideal if you are looking for a model to use directly, as this provides data for training models, not the models themselves.

Multimodal AI Large Language Models Dataset Curation Visual Question Answering Image Captioning

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

RenzeLou/awesome-instruction-learning

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

Explore ML Frameworks

All categories Trending ML Framework directory Insights