lemony-ai/cascadeflow
Cascading runtime for AI agents. Optimize cost, latency, quality, and policy decisions inside the agent loop.
This is a tool for developers building AI agent applications who need to optimize the performance and cost of their agent's decision-making process. It takes your existing AI agent code and allows you to define policies that dynamically select the best AI model for each step, considering factors like cost, speed, and quality. AI application developers, machine learning engineers, and MLOps professionals are the primary users.
294 stars. Available on PyPI.
Use this if you are developing AI agent applications and want to reduce operational costs, improve response times, and maintain high quality by intelligently managing which AI models your agent uses at each step.
Not ideal if you are an end-user of an AI application or if you only need high-level monitoring of AI API calls without needing to influence in-process agent decisions.
Stars
294
Forks
96
Language
Python
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/lemony-ai/cascadeflow"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related agents
lobehub/lobehub
The ultimate space for work and life — to find, build, and collaborate with agent teammates that...
Pipelex/pipelex
Declarative language for composable Al workflows. Devtool for agents and mere humans.
strands-agents/sdk-typescript
A model-driven approach to building AI agents in just a few lines of code.
agents-flex/agents-flex
Agents-flex is A Lightweight Java AI Application Development Framework.
JetBrains/koog
Koog is a JVM framework for building predictable, fault-tolerant and enterprise-ready AI agents...