liangyuwang/Tiny-Megatron

Tiny-Megatron, a minimalistic re-implementation of the Megatron library

/ 100

Emerging

This project helps machine learning engineers and researchers understand and implement distributed training strategies for large language models. It takes a PyTorch model and an HPC cluster configuration as input, and outputs a functionally identical model that can be trained efficiently across multiple GPUs or nodes. It's designed for those learning how to scale deep learning models for faster training or to fit larger models into memory.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher looking to learn about or implement tensor, data, or 2D hybrid parallelism strategies for training large language models in PyTorch.

Not ideal if you need a production-ready library with advanced features like pipeline parallelism or optimizer state sharding, or if you are not comfortable with PyTorch and distributed training concepts.

distributed-deep-learning large-language-models model-training pytorch hpc-cluster-management

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

LowinLi/transformers-stream-generator

This is a text generation method which returns a generator, streaming out each token in...

ystemsrx/mini-nanoGPT

One-click training of your own GPT. Training a GPT has never been easier for beginners. /...

jaymody/picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

kyegomez/AttentionGrid

A network of attention mechanisms at your fingertips. Unleash the potential of attention...

kamalkraj/minGPT-TF

A minimal TF2 re-implementation of the OpenAI GPT training

Explore LLM Tools

All categories Trending LLM Tool directory Insights