scaleapi/llm-engine
Scale LLM Engine public repository
This tool helps machine learning engineers and data scientists customize and deploy large language models (LLMs) for specific tasks. It takes your raw data and an LLM, then outputs a fine-tuned model ready for use. It's designed for professionals building AI applications who need to adapt powerful language models to their unique datasets.
821 stars. Actively maintained with 23 commits in the last 30 days.
Use this if you need to fine-tune an open-source large language model with your own data and efficiently serve it in production, either via Scale's cloud or your own Kubernetes infrastructure.
Not ideal if you are an end-user looking for a pre-built AI application or do not have experience with machine learning model deployment and infrastructure.
Stars
821
Forks
73
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 12, 2026
Commits (30d)
23
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/scaleapi/llm-engine"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
AGI-Arena/MARS
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
modelscope/easydistill
a toolkit on knowledge distillation for large language models
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient...
Wang-ML-Lab/bayesian-peft
Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025]
sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language...