OpenSparseLLMs/LLaMA-MoE-v2

🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

/ 100

Emerging

This project offers a collection of pre-trained, smaller, and more efficient language models built from LLaMA3, designed for advanced natural language processing. It takes existing large language models and outputs versions optimized for specific tasks, making them more affordable to run. Data scientists, machine learning engineers, and AI researchers would use this to deploy specialized AI.

No commits in the last 6 months.

Use this if you need to deploy a high-performing language model for specific tasks, but are concerned about the computational cost and resource demands of larger models.

Not ideal if you need an out-of-the-box, general-purpose large language model that doesn't require specialized fine-tuning or model construction.

natural-language-processing machine-learning-deployment AI-optimization model-fine-tuning computational-efficiency

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

facebookresearch/LayerSkip

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024

FareedKhan-dev/train-llm-from-scratch

A straightforward method for training your LLM, from downloading data to generating text.

kmeng01/rome

Locating and editing factual associations in GPT (NeurIPS 2022)

datawhalechina/llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

Explore Transformer Models

All categories Trending Transformer directory Insights