justrach/bhumi

⚡ Bhumi – The fastest AI inference client for Python, built with Rust for unmatched speed, efficiency, and scalability 🚀

54
/ 100
Established

This tool helps developers who are building applications that use large language models (LLMs) and need to ensure their applications are extremely fast and efficient. It allows them to send requests to over nine different AI providers, including OpenAI, Anthropic, and Google Gemini, and receive responses. Developers use this to integrate AI capabilities into their products with high performance, handling tasks like text generation and image analysis.

Available on PyPI.

Use this if you are a developer building production-ready AI applications and need the fastest possible interaction with various LLM and vision APIs.

Not ideal if you are an end-user without programming experience or if your application does not require high-throughput, low-latency AI inference.

AI-application-development LLM-integration high-performance-computing API-client-development production-AI-systems
Maintenance 10 / 25
Adoption 8 / 25
Maturity 25 / 25
Community 11 / 25

How are scores calculated?

Stars

64

Forks

6

Language

Python

License

Apache-2.0

Last pushed

Jan 22, 2026

Commits (30d)

0

Dependencies

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/justrach/bhumi"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.