Shenggan/atp

Adaptive Tensor Parallelism for Foundation Models

/ 100

Experimental

This project helps machine learning engineers and researchers efficiently train and deploy very large AI models, often called foundation models. It takes your existing large model architecture and training setup, then intelligently optimizes how computations are distributed across multiple GPUs or machines. The result is faster training times and more efficient inference for your large AI models.

No commits in the last 6 months.

Use this if you are working with extremely large AI models and need to reduce their training or inference time by optimizing how they utilize distributed hardware.

Not ideal if you are working with smaller models or do not have access to a distributed computing environment with multiple GPUs.

large-language-models distributed-training model-deployment deep-learning-infrastructure AI-model-optimization

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

AliHaiderAhmad001/GPT-from-Scratch-with-Tensorflow

Implementation for "Improving Language Understanding by Generative Pre-Training" paper

HomebrewML/HomebrewNLP-torch

A case study of efficient training of large language models using commodity hardware.

akshat0123/GPT-1

Pytorch implementation of GPT-1

qiqiApink/MotionGPT

The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs are General-Purpose...

nawnoes/pytorch-gpt-x

An implementation of an autoregressive language model using an improved Transformer and...

Explore Transformer Models

All categories Trending Transformer directory Insights