InternLM/xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

/ 100

Verified

XTuner V1 helps machine learning engineers and researchers efficiently train very large AI models, specifically those with Mixture-of-Experts (MoE) architectures. It takes large datasets and model configurations as input to produce powerful, highly-optimized AI models. This is ideal for those working with cutting-edge AI research or deploying state-of-the-art large language models.

5,096 stars. Actively maintained with 66 commits in the last 30 days. Available on PyPI.

Use this if you need to train ultra-large-scale AI models, especially MoE architectures, and require highly efficient training on extensive datasets and long sequences.

Not ideal if you are working with smaller AI models or do not have access to advanced GPU or NPU hardware for large-scale distributed training.

large-language-models AI-model-training deep-learning-research scalable-AI multimodal-AI

Maintenance 22 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

5,096

Forks

405

Language

Python

License

Apache-2.0

Recent Releases

v1.0.0rc0 18 Nov 2025 v0.2.0 11 Jul 2025 v0.2.0rc0 21 Feb 2025 v0.1.23 22 Jul 2024 v0.1.22 19 Jul 2024

Related tools

AmanPriyanshu/GPT-OSS-MoE-ExpertFingerprinting

ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in...

arm-education/Advanced-AI-Mixture-of-Experts

Hands-on course materials for ML engineers to implement and optimize Mixture of Experts models:...

SuperBruceJia/Awesome-Mixture-of-Experts

Awesome Mixture of Experts (MoE): A Curated List of Mixture of Experts (MoE) and Mixture of...

sumitdotml/moe-emergence

a project highlighting the emergent expert specialization in Mixture of Experts (MoEs) across 3...

iahuang/cosmoe

Enabling inference of large mixture-of-experts (MoE) models on Apple Silicon using dynamic offloading.

Explore LLM Tools

All categories Trending LLM Tool directory Insights