puneethkotha/Falcon
Production ML inference platform. Multi-worker · Nginx load balancing · idempotency · exponential backoff · Prometheus metrics. Reduced p95 latency 30%.
Stars
—
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 17, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mlops/puneethkotha/Falcon"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
feast-dev/feast
The Open Source Feature Store for AI/ML
clearml/clearml-serving
ClearML - Model-Serving Orchestration and Repository Solution
lakehq/sail
LakeSail's computation framework with a mission to unify batch processing, stream processing,...
PaddlePaddle/Serving
A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
SeldonIO/MLServer
An inference server for your machine learning models, including support for multiple frameworks,...