Agora-Lab-AI/OmniByteGPT

An implementation of an all-new foundation model architecture that trains on byte sequences from multiple modalities to handle omni-modal generation of text, video, images and more.

/ 100

Emerging

This project helps AI researchers and advanced practitioners create and manipulate diverse digital content using a single model. It takes in raw byte sequences, which can represent anything from text to video, and generates new byte sequences that can be interpreted as various content types. It's designed for those who need a unified approach to generate and transform text, images, audio, and other data without relying on separate, specialized models for each.

Use this if you are developing new foundation models and need a truly universal architecture for generating content across multiple modalities, from raw byte data.

Not ideal if you are looking for an off-the-shelf solution for single-modality content creation or if you are not comfortable working with byte-level data.

AI-research multi-modal-generation foundation-models universal-data-processing generative-AI

No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron...

shibing624/textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet...

ai-forever/ru-gpts

Russian GPT3 models.

AdityaNG/kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold...

Explore Transformer Models

All categories Trending Transformer directory Insights