mit-han-lab/hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

/ 100

Emerging

This project helps machine translation practitioners deliver faster, more efficient language models on various devices. You input a machine translation task and the target hardware (like a Raspberry Pi or a powerful server CPU/GPU), and it outputs a specialized, pre-trained translation model optimized for speed and size on that hardware, without sacrificing translation quality. This is ideal for researchers and engineers deploying language models in diverse environments.

336 stars. No commits in the last 6 months.

Use this if you need to deploy high-quality machine translation models that run quickly and efficiently on specific hardware, especially devices with limited resources.

Not ideal if you are looking for a general-purpose, off-the-shelf translation service without specific hardware optimization needs.

machine-translation natural-language-processing edge-ai model-optimization efficient-inference

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

336

Forks

Language

Python

License

—

Higher-rated alternatives

huggingface/transformers-bloom-inference

Fast Inference Solutions for BLOOM

Tencent/TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc)...

mit-han-lab/lite-transformer

[ICLR 2020] Lite Transformer with Long-Short Range Attention

LibreTranslate/Locomotive

Toolkit for training/converting LibreTranslate compatible language models 🚂

aliemo/transfomers-silicon-research

Research and Materials on Hardware implementation of Transformer Model

Explore Transformer Models

All categories Trending Transformer directory Insights