krasserm/perceiver-io

A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training

/ 100

Established

This project provides advanced artificial intelligence models that can understand and generate various types of data like video, audio, and text. It takes in complex, unstructured inputs, processes them, and delivers useful outputs such as predicted video movements (optical flow) or generated musical sequences. This is ideal for AI researchers and machine learning engineers who need flexible and powerful models for multimodal data tasks.

518 stars. No commits in the last 6 months. Available on PyPI.

Use this if you are a machine learning engineer or AI researcher looking to implement or train state-of-the-art AI models capable of processing and generating diverse data types like video, audio, or text.

Not ideal if you are a practitioner looking for a ready-to-use, no-code solution or a non-technical user who needs a simple application for a specific task.

AI model development multimodal data processing generative AI computer vision audio synthesis

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 16 / 25

How are scores calculated?

Stars

518

Forks

Language

Python

License

Apache-2.0

Related frameworks

open-mmlab/mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Xilinx/brevitas

Brevitas: neural network quantization in PyTorch

google/qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

fastmachinelearning/qonnx

QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization...

Explore ML Frameworks

All categories Trending ML Framework directory Insights