ManifoldRG/NEKO
Implementation of GATO style Generalist Multimodal model capable of image, text, RL and Robotics tasks
This project offers tools to train a single AI model that can learn from and interact with various types of data and tasks, like understanding text, recognizing images, and controlling robots. It takes in diverse datasets such as written language, visual inputs, and robot movement logs, and produces a versatile AI capable of performing across these different domains. Researchers and advanced AI developers working on generalist AI would use this.
No commits in the last 6 months.
Use this if you are a researcher or AI developer working on building or experimenting with 'generalist' AI models that can handle a wide range of tasks and data types.
Not ideal if you need a pre-trained, ready-to-use solution for a specific application without advanced AI development.
Stars
45
Forks
11
Language
Python
License
GPL-3.0
Category
Last pushed
Jun 19, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ManifoldRG/NEKO"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
HuaizhengZhang/Awsome-Deep-Learning-for-Video-Analysis
Papers, code and datasets about deep learning and multi-modal learning for video analysis
KaiyangZhou/pytorch-vsumm-reinforce
Unsupervised video summarization with deep reinforcement learning (AAAI'18)
adambielski/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch