virevolai/logos-shift-client

Replace expensive LLM calls with finetunes automatically

/ 100

Emerging

This helps engineering teams automatically reduce costs and latency for applications that use expensive large language models (LLMs) like GPT or Claude. It observes your existing LLM calls and then automatically trains and deploys cheaper, faster fine-tuned models like Llama or Mistral when they're ready, without you having to manually manage A/B tests or deployments. It's for engineering or product teams building LLM-powered features in production.

No commits in the last 6 months. Available on PyPI.

Use this if you are deploying LLMs in production and want to automatically replace expensive, high-latency API calls with cheaper, faster fine-tuned models without manual intervention.

Not ideal if you prefer to manually manage every step of your model fine-tuning and deployment, or if cost and latency are not primary concerns for your LLM applications.

LLM-operations MLOps-automation cost-optimization production-deployment AI-engineering

Stale 6m

Maintenance 0 / 25

Adoption 8 / 25

Maturity 25 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

Goekdeniz-Guelmez/mlx-lm-lora

Train Large Language Models on MLX.

uber-research/PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

VHellendoorn/Code-LMs

Guide to using pre-trained large language models of source code

ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

jarobyte91/pytorch_beam_search

A lightweight implementation of Beam Search for sequence models in PyTorch.

Explore Transformer Models

All categories Trending Transformer directory Insights