airaria/TextBrewer

A PyTorch-based knowledge distillation toolkit for natural language processing

/ 100

Emerging

This toolkit helps machine learning engineers and NLP researchers make large language models run faster and use less memory without significantly losing accuracy. You provide a powerful, high-performing 'teacher' model and a smaller 'student' model, and it helps the student learn from the teacher to achieve near-teacher performance. The output is a compact, optimized language model ready for deployment in real-world applications.

1,697 stars. No commits in the last 6 months.

Use this if you need to deploy large language models for tasks like text classification, question answering, or named entity recognition in environments with limited computational resources or strict latency requirements.

Not ideal if you are working with non-text data or if you need to build a language model from scratch without a larger 'teacher' model to learn from.

natural-language-processing model-optimization machine-learning-deployment text-analytics AI-efficiency

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

1,697

Forks

246

Language

Python

License

Apache-2.0

Related tools

sunyilgdx/NSP-BERT

The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original...

kssteven418/LTP

[KDD'22] Learned Token Pruning for Transformers

princeton-nlp/CoFiPruning

[ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408

georgian-io/Transformers-Domain-Adaptation

:no_entry: [DEPRECATED] Adapt Transformer-based language models to new text domains

qiangsiwei/bert_distill

BERT distillation（基于BERT的蒸馏实验）

Explore NLP Tools

All categories Trending NLP directory Insights