krasserm/perceiver-io
A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training
This project provides advanced artificial intelligence models that can understand and generate various types of data like video, audio, and text. It takes in complex, unstructured inputs, processes them, and delivers useful outputs such as predicted video movements (optical flow) or generated musical sequences. This is ideal for AI researchers and machine learning engineers who need flexible and powerful models for multimodal data tasks.
518 stars. No commits in the last 6 months. Available on PyPI.
Use this if you are a machine learning engineer or AI researcher looking to implement or train state-of-the-art AI models capable of processing and generating diverse data types like video, audio, or text.
Not ideal if you are a practitioner looking for a ready-to-use, no-code solution or a non-technical user who needs a simple application for a specific task.
Stars
518
Forks
46
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 02, 2024
Commits (30d)
0
Dependencies
10
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/krasserm/perceiver-io"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
open-mmlab/mmengine
OpenMMLab Foundational Library for Training Deep Learning Models
Xilinx/brevitas
Brevitas: neural network quantization in PyTorch
google/qkeras
QKeras: a quantization deep learning library for Tensorflow Keras
fastmachinelearning/qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
tensorflow/model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization...