cmu-flame/FLAME-MoE

Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models

/ 100

Emerging

This platform helps AI researchers develop and test Mixture-of-Experts (MoE) language models. It takes in raw textual data and configuration settings to produce trained MoE models and evaluation metrics. AI researchers and machine learning engineers focused on advanced language model architectures would use this.

No commits in the last 6 months.

Use this if you are an AI researcher building, training, and evaluating Mixture-of-Experts language models and need a robust, transparent framework for your experiments.

Not ideal if you are looking to simply use a pre-trained language model or fine-tune an existing model without delving into MoE architecture research.

AI Research Large Language Models Mixture-of-Experts Machine Learning Engineering Natural Language Processing

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 7 / 25

Maturity 7 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

EfficientMoE/MoE-Infinity

PyTorch library for cost-effective, fast and easy serving of MoE models.

raymin0223/mixture_of_recursions

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation...

AviSoori1x/makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej...

thu-nics/MoA

[CoLM'25] The official implementation of the paper

jaisidhsingh/pytorch-mixtures

One-stop solutions for Mixture of Expert modules in PyTorch.

Explore Transformer Models

All categories Trending Transformer directory Insights