OpenSQZ/MegatronApp

Toolchain built around the Megatron-LM for Distributed Training

42
/ 100
Emerging

When training large language models with Megatron-LM across many GPUs, this toolchain helps optimize performance and understand what's happening inside the model. It takes your Megatron-LM training configuration and outputs visualizations, performance insights, and diagnostics to pinpoint slowdowns. AI/ML engineers and researchers working with distributed model training are the primary users.

Use this if you are training large language models with Megatron-LM and need to diagnose performance bottlenecks, optimize resource usage, or gain real-time visual insights into the model's internal workings.

Not ideal if you are working with smaller models, single-GPU training, or a different distributed training framework.

large-language-models distributed-training model-performance AI-research deep-learning-operations
No Package No Dependents
Maintenance 10 / 25
Adoption 9 / 25
Maturity 15 / 25
Community 8 / 25

How are scores calculated?

Stars

90

Forks

5

Language

Python

License

Last pushed

Mar 05, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/OpenSQZ/MegatronApp"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.