luchangli03/export_llama_to_onnx

export llama to onnx

42
/ 100
Emerging

This tool helps machine learning engineers and MLOps professionals convert large language models (LLMs) like LLaMA, Qwen, and ChatGLM into ONNX format for efficient deployment. You provide your existing Hugging Face LLM model files, and it produces optimized ONNX files. This is used by anyone looking to deploy LLMs more efficiently in production environments.

135 stars. No commits in the last 6 months.

Use this if you need to optimize and standardize your trained LLMs (like LLaMA, Qwen, ChatGLM, Gemma, Bloom) into ONNX format for faster inference and easier deployment across various platforms.

Not ideal if you are looking for a tool to train LLMs or if you do not have a technical understanding of model deployment and ONNX.

LLM deployment model optimization machine learning operations AI inference deep learning engineering
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

135

Forks

18

Language

Python

License

MIT

Last pushed

Dec 28, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/luchangli03/export_llama_to_onnx"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.