krasserm/perceiver-io

A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training

51
/ 100
Established

This project provides advanced artificial intelligence models that can understand and generate various types of data like video, audio, and text. It takes in complex, unstructured inputs, processes them, and delivers useful outputs such as predicted video movements (optical flow) or generated musical sequences. This is ideal for AI researchers and machine learning engineers who need flexible and powerful models for multimodal data tasks.

518 stars. No commits in the last 6 months. Available on PyPI.

Use this if you are a machine learning engineer or AI researcher looking to implement or train state-of-the-art AI models capable of processing and generating diverse data types like video, audio, or text.

Not ideal if you are a practitioner looking for a ready-to-use, no-code solution or a non-technical user who needs a simple application for a specific task.

AI model development multimodal data processing generative AI computer vision audio synthesis
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 16 / 25

How are scores calculated?

Stars

518

Forks

46

Language

Python

License

Apache-2.0

Last pushed

Jan 02, 2024

Commits (30d)

0

Dependencies

10

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/krasserm/perceiver-io"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.