ManifoldRG/NEKO

Implementation of GATO style Generalist Multimodal model capable of image, text, RL and Robotics tasks

/ 100

Emerging

This project offers tools to train a single AI model that can learn from and interact with various types of data and tasks, like understanding text, recognizing images, and controlling robots. It takes in diverse datasets such as written language, visual inputs, and robot movement logs, and produces a versatile AI capable of performing across these different domains. Researchers and advanced AI developers working on generalist AI would use this.

No commits in the last 6 months.

Use this if you are a researcher or AI developer working on building or experimenting with 'generalist' AI models that can handle a wide range of tasks and data types.

Not ideal if you need a pre-trained, ready-to-use solution for a specific application without advanced AI development.

generalist-ai multimodal-learning reinforcement-learning natural-language-processing computer-vision

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

GPL-3.0

Higher-rated alternatives

open-mmlab/mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

facebookresearch/mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis

Papers, code and datasets about deep learning and multi-modal learning for video analysis

KaiyangZhou/pytorch-vsumm-reinforce

Unsupervised video summarization with deep reinforcement learning (AAAI'18)

adambielski/siamese-triplet

Siamese and triplet networks with online pair/triplet mining in PyTorch

Explore ML Frameworks

All categories Trending ML Framework directory Insights