INK-USC/sparse-distillation

Code for "Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models"

/ 100

Emerging

This project helps data scientists and machine learning engineers speed up text classification tasks. It takes large, pre-trained language models and unlabeled text data to produce a smaller, faster model that performs text classification with high accuracy. This is ideal for teams needing to deploy text classifiers efficiently without sacrificing performance.

No commits in the last 6 months.

Use this if you need to classify text data quickly and accurately, and have access to both labeled examples and a large corpus of unlabeled text.

Not ideal if you don't have a pre-trained RoBERTa model or a substantial amount of unlabeled text data to leverage for the distillation process.

text-classification natural-language-processing sentiment-analysis machine-learning-operations model-optimization

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

airaria/TextBrewer

A PyTorch-based knowledge distillation toolkit for natural language processing

sunyilgdx/NSP-BERT

The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original...

kssteven418/LTP

[KDD'22] Learned Token Pruning for Transformers

princeton-nlp/CoFiPruning

[ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408

georgian-io/Transformers-Domain-Adaptation

:no_entry: [DEPRECATED] Adapt Transformer-based language models to new text domains

Explore NLP Tools

All categories Trending NLP directory Insights