GradientHQ/parallax

Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere

57
/ 100
Established

Setting up and managing large AI models (like ChatGPT, Llama) on your own servers or devices can be complex and expensive. Parallax simplifies this by letting you pool your existing computing power, even across different machines and locations, to run these models efficiently. It takes your chosen large language model and helps you deliver its predictions or responses to users without needing specialized, costly infrastructure. This is designed for organizations and individuals who want to host powerful AI models themselves, without relying on external cloud providers.

1,152 stars. Actively maintained with 2 commits in the last 30 days.

Use this if you need to run large AI models on your own hardware, distributing the workload across multiple machines for cost-effectiveness and control.

Not ideal if you prefer to use fully managed cloud services for your AI model hosting or only need to run small, less resource-intensive models.

AI hosting private cloud AI model deployment large language model (LLM) serving distributed computing
No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 15 / 25
Community 19 / 25

How are scores calculated?

Stars

1,152

Forks

118

Language

Python

License

Apache-2.0

Last pushed

Mar 12, 2026

Commits (30d)

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/GradientHQ/parallax"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.