bloomberg/minilmv2.bb

Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)

/ 100

Emerging

This project helps machine learning engineers and researchers reduce the size and computational cost of large language models while maintaining their performance. It takes pre-trained large language models (teacher models) and the data they were trained on as input. The output is a smaller, more efficient 'student' language model that can be deployed more easily.

No commits in the last 6 months.

Use this if you need to deploy powerful transformer-based language models in resource-constrained environments or accelerate inference times for natural language processing tasks.

Not ideal if you are looking for a pre-trained, ready-to-use small language model without needing to perform a distillation process yourself.

natural-language-processing machine-learning-deployment model-optimization transformer-models

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

facebookresearch/LayerSkip

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024

FareedKhan-dev/train-llm-from-scratch

A straightforward method for training your LLM, from downloading data to generating text.

kmeng01/rome

Locating and editing factual associations in GPT (NeurIPS 2022)

datawhalechina/llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

Explore Transformer Models

All categories Trending Transformer directory Insights