elephantmipt/bert-distillation

Distillation of BERT model with catalyst framework

35
/ 100
Emerging

This project helps machine learning engineers or data scientists compress large BERT-based language models. It takes an existing, well-trained BERT model and training data as input, and outputs a smaller, faster version of that model. This is useful for deploying language models on resource-constrained devices like mobile phones or for speeding up inference in applications.

No commits in the last 6 months.

Use this if you need to reduce the size and increase the inference speed of a BERT-based language model while maintaining most of its performance.

Not ideal if you are a non-technical user or if you require a simple, out-of-the-box solution without custom code.

natural-language-processing model-compression deep-learning-deployment resource-optimization
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 10 / 25

How are scores calculated?

Stars

78

Forks

7

Language

Python

License

MIT

Last pushed

Jun 12, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/elephantmipt/bert-distillation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.