HuBoren99/SmartBert

The implementation of SmartBERT: A Promotion of Dynamic Early Exiting Mechanism for Accelerating BERT Inference

34
/ 100
Emerging

This project helps machine learning engineers and researchers speed up how quickly their BERT models can make predictions from text data. It takes a pre-trained BERT model and text datasets (like sentiment analysis or question answering), and outputs a faster-performing BERT model for deployment. This is for anyone deploying large language models who needs to optimize inference speed.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher looking to accelerate the inference time of your BERT-based natural language processing models.

Not ideal if you are looking for a pre-trained model for direct use without optimization, or if your primary concern is model training speed rather than inference speed.

natural-language-processing large-language-models model-optimization machine-learning-deployment text-analytics
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

9

Forks

2

Language

Python

License

MIT

Last pushed

Jan 15, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/HuBoren99/SmartBert"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.