Heratiki/locallama-mcp
An MCP Server that works with Roo Code/Cline.Bot/Claude Desktop to optimize costs by intelligently routing coding tasks between local LLMs free APIs and paid APIs.
This tool helps developers reduce the cost of using large language models (LLMs) for coding tasks. It acts as a smart router, taking your coding requests and deciding whether to send them to a free, local LLM or a more expensive, cloud-based API. The output is optimized code generation at a lower overall cost, ideal for individual developers or small teams managing LLM expenses.
No commits in the last 6 months.
Use this if you are a software developer who uses LLMs for coding assistance and wants to lower your monthly API expenses by intelligently leveraging free local models when appropriate.
Not ideal if you primarily use LLMs for non-coding tasks, do not have local LLMs set up, or your priority is always maximum output quality regardless of cost.
Stars
41
Forks
12
Language
TypeScript
License
—
Category
Last pushed
Jul 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mcp/Heratiki/locallama-mcp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
thedaviddias/mcp-llms-txt-explorer
MCP to explore websites with llms.txt files
jonigl/ollama-mcp-bridge
Extend the Ollama API with dynamic AI tool integration from multiple MCP (Model Context...
CodeLogicIncEngineering/codelogic-mcp-server
An MCP Server to utilize Codelogic's rich software dependency data in your AI programming assistant.
sib-swiss/sparql-llm
🦜✨ Chat system, MCP server, and reusable components to improve LLMs capabilities when generating...
webworn/openfoam-mcp-server
LLM-powered OpenFOAM MCP server for intelligent CFD education with Socratic questioning and...