houtini-ai/lm

Offload Tasks from Claude to your Local LLM With Houtini-LM - uses OpenAPI for LM Studio and Ollama Compatibility. Save tokens by offloading some grunt work for your API - our tool description helps claude decide what work to assign and why.

/ 100

Emerging

This tool helps software developers reduce their cloud LLM costs when using Claude Code for large programming tasks. It automatically routes routine coding chores, like generating test stubs or commit messages, to a local, free LLM server. Claude continues to handle complex architectural planning, while the local model takes care of the grunt work, saving you significant token expenses.

Use this if you are a software developer frequently using Claude Code for extensive refactoring, code reviews, or generating boilerplate, and want to lower your cloud API costs.

Not ideal if you primarily use Claude for small, quick questions or tasks that don't involve reading and analyzing large code files, as the overhead may outweigh the token savings.

software-development cloud-cost-optimization code-generation developer-tools local-LLM-integration

No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 11 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

JavaScript

License

MIT

Related agents

bkrajendra/arduino-llm-agent

An intelligent backend server that uses Large Language Models (LLMs) to generate, compile, and...

imfing/codemode

Let LLMs write code that orchestrates your Python tools in a v8 sandbox

Explore AI Agents

All categories Trending AI Agent directory Insights