huggingface/optimum-habana

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

/ 100

Established

This project helps machine learning engineers accelerate the training and inference of large language models and diffusion models, such as those from the Hugging Face Transformers and Diffusers libraries. It takes existing model code and configuration and outputs significantly faster computations by leveraging Intel Gaudi AI Accelerators. This is for machine learning practitioners and researchers working with large-scale models who need to optimize performance on specific hardware.

207 stars.

Use this if you are a machine learning engineer working with Hugging Face models and have access to Intel Gaudi AI Accelerators, and you need to significantly speed up your model training or inference processes.

Not ideal if you do not use Intel Gaudi AI Accelerators or if your workflow does not involve models from the Hugging Face Transformers or Diffusers libraries.

AI-accelerators large-language-models image-generation deep-learning-optimization ML-infrastructure

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

207

Forks

270

Language

Python

License

Apache-2.0

Compare

optimum-habana and optimum optimum-habana and optimum-graphcore

Related models

openvinotoolkit/nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

huggingface/optimum

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers...

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

huggingface/optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

eole-nlp/eole

Open language modeling toolkit based on PyTorch

Explore Transformer Models

All categories Trending Transformer directory Insights