N1k1tung/infer-ring

Infer Ring is an iOS and macOS app that facilitates cross-device LLM inference using MLX

/ 100

Emerging

Infer Ring helps you run large AI models, known as LLMs, directly on your Apple devices even if a single device doesn't have enough memory. It takes the model you want to use and distributes it across multiple iPhones, iPads, and Macs, letting you interact with larger models locally. This is for researchers, developers, or hobbyists who want to experiment with large AI models without needing powerful cloud servers.

Use this if you want to run powerful large language models locally on your Apple devices by combining their memory, rather than relying on expensive cloud services.

Not ideal if you need extremely fast token generation speeds for real-time applications, as performance might be slightly slower compared to a single, very powerful machine.

AI-model-deployment local-AI edge-AI LLM-experimentation distributed-computing

No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 11 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Swift

License

MIT

Higher-rated alternatives

jundot/omlx

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the...

josStorer/RWKV-Runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface...

jordanhubbard/nanolang

A tiny experimental language designed to be targeted by coding LLMs

waybarrios/vllm-mlx

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models...

akivasolutions/tightwad

Pool your CUDA + ROCm GPUs into one OpenAI-compatible API. Speculative decoding proxy gives you...

Explore LLM Tools

All categories Trending LLM Tool directory Insights