LLM Orchestration Platforms Embedding Tools
Tools for unifying access to multiple LLM providers, managing model selection, routing requests, and coordinating AI agent execution across different backends. Does NOT include domain-specific applications, single-provider SDKs, or deployment infrastructure (unless orchestration is the primary purpose).
There are 74 llm orchestration platforms tools tracked. 4 score above 50 (established tier). The highest-rated is lfnovo/esperanto at 63/100 with 157 stars.
Get all 74 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=llm-orchestration-platforms&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
lfnovo/esperanto
A unified interface for various AI model providers |
|
Established |
| 2 |
apocas/restai
RESTai is an AIaaS (AI as a Service) open-source platform. Built on top of... |
|
Established |
| 3 |
baidubce/bce-qianfan-sdk
Provide best practices for LMOps, as well as elegant and convenient access... |
|
Established |
| 4 |
sbhjt-gr/InferrLM
On-device AI for iOS & Android |
|
Established |
| 5 |
ProviderProtocol/ai
0-DEP AI DX SDK |
|
Emerging |
| 6 |
Muvon/octolib
The lib to power AI tools. |
|
Emerging |
| 7 |
spring-petclinic/spring-petclinic-langchain4j
Spring Petclinic application with a chatbot powered by OpenAI's Generative... |
|
Emerging |
| 8 |
solygambas/python-openai-projects
13 projects using ChatGPT API, Whisper, Embeddings, and DALL-E with Python. |
|
Emerging |
| 9 |
itayzit/openai-async
A light-weight, asynchronous client for OpenAI API - text completion, image... |
|
Emerging |
| 10 |
IBM/watsonx-ai-java-sdk
The watsonx.ai Java SDK is an open-source library that simplifies the... |
|
Emerging |
| 11 |
maragudk/gai
Go Artificial Intelligence (GAI) helps you work with foundational models,... |
|
Emerging |
| 12 |
rrg92/powershai
Powershell + AI |
|
Emerging |
| 13 |
jchristn/SharpAI
SharpAI is an embeddable embeddings, completions, and model management... |
|
Emerging |
| 14 |
SANSA-Stack/Archived-SANSA-ML
SANSA Machine Learning Layer |
|
Emerging |
| 15 |
UBOS-tech/node-red-contrib-openai-ubos
A Node-RED node that interacts with OpenAI machine learning models to... |
|
Emerging |
| 16 |
lpalbou/AbstractCore
A unified Python library for interaction with multiple Large Language Model... |
|
Emerging |
| 17 |
ramanujammv1988/edge-veda
On-device AI SDK for Flutter — LLM inference, vision, STT, TTS, image... |
|
Emerging |
| 18 |
netresearch/t3x-nr-llm
The shared AI foundation for TYPO3 — one LLM setup for every extension on your site |
|
Emerging |
| 19 |
eliben/gemini-cli
Access Gemini LLMs from the command-line |
|
Emerging |
| 20 |
connerohnesorge/semanticrouter-go
Fast & less costly AI decision making and intelligent processing of multi-modal data. |
|
Emerging |
| 21 |
Prismadic/magnet
the small distributed language model toolkit; fine-tune state-of-the-art... |
|
Emerging |
| 22 |
kingabzpro/easy-local-ai
Run Ollama (Llama3.2), Langflow, Postgres, and qdrant server using Docker Compose. |
|
Emerging |
| 23 |
proj-airi/webai-examples
🧠 Web AI / LLM in browser / Whisper in browser / WebGPU inference Examples |
|
Emerging |
| 24 |
wizenheimer/tinylm
Browser based ML Inference | OpenAI compliant | Run models like DeepSeek,... |
|
Emerging |
| 25 |
priyanshua44/no-llm
no-llm is a lightweight library designed to simplify the integration of... |
|
Emerging |
| 26 |
pinecone-io/gemini-cli-extension
The official Pinecone Gemini CLI extension repo. |
|
Emerging |
| 27 |
skorotkiewicz/llmnet
The Offline Internet. |
|
Experimental |
| 28 |
cheahjs/gemini-to-openai-proxy
Call Gemini (https://ai.google.dev) embedding models with OpenAI-compatible endpoints |
|
Experimental |
| 29 |
nshkrdotcom/ollixir
Ollixir provides a first-class Elixir client with feature parity to the... |
|
Experimental |
| 30 |
symfony/ai-open-ai-platform
OpenAI platform bridge for Symfony AI |
|
Experimental |
| 31 |
bennyschmidt/next-token-prediction
Next-token prediction in JavaScript — build fast language and diffusion models. |
|
Experimental |
| 32 |
burgerkhan6227/tokenWise-Optimizer
🎯 Optimize LLM token usage by 70-90% with smart context ranking, reducing... |
|
Experimental |
| 33 |
Netsleek/the-selection-layer
Defines the Selection Layer — the decision system through which AI models... |
|
Experimental |
| 34 |
codenameakshay/ai_sdk_dart
Dart/Flutter port of Vercel AI SDK v6 — provider-agnostic text generation,... |
|
Experimental |
| 35 |
Naseem77/tokenWise-Optimizer
Smart Context Optimization for LLMs - Reduce tokens by 66%, save 40% on API... |
|
Experimental |
| 36 |
PabloSanchi/IBM-WatsonxAI-Spring-AI-Example
Example of IBM watsonx.ai with Spring AI |
|
Experimental |
| 37 |
liliang-cn/agent-go
AI Agent SDK designed for Go developers |
|
Experimental |
| 38 |
leafspec/spec
Framework-agnostic specification for AI-native applications. LEAF (Listen,... |
|
Experimental |
| 39 |
Aman00000007/lexilux
🚀 Simplify API calls with Lexilux, a unified LLM client that lets you access... |
|
Experimental |
| 40 |
bernard777/tool-selector-cascade
Cascading tool selector for LLM agents — narrows 1000+ tools to the best... |
|
Experimental |
| 41 |
bgokden/llama-embedding
High-performance, thread-safe embedding library using llama.cpp. |
|
Experimental |
| 42 |
kamil5b/go-nl2query-lib
A Go library for converting natural language queries into database queries... |
|
Experimental |
| 43 |
m1thrandir225/galore-services
The main services for the Galore mobile app. |
|
Experimental |
| 44 |
bgokden/llama-reranker
High-performance, thread-safe reranking library using llama.cpp |
|
Experimental |
| 45 |
Dragon-Born/go-llm
Developer-friendly Go SDK for LLM apps: OpenAI, Anthropic, Gemini, Ollama,... |
|
Experimental |
| 46 |
languageseed/valet-gateway
AI Inference Gateway - orchestrates Ollama, vLLM, cloud providers, and... |
|
Experimental |
| 47 |
sahil-makandar/beyond-chatgpt-ai-systems
Guest lecture presentation explaining modern AI architecture: LLMs, RAG... |
|
Experimental |
| 48 |
douglasmitsue/ai-systems-engineering-portfolio
Designing and building scalable AI systems - from deep neural networks to... |
|
Experimental |
| 49 |
superdupers1/ai-thought-visual
🎨 Transform language, voice, and images into structured AI concepts,... |
|
Experimental |
| 50 |
Ammar-Alnagar/Axion
Axion is a high-performance LLM serving platform built with Rust that... |
|
Experimental |
| 51 |
LLM-Gateway-ORG/llm-gateway-core
A Python package to access different LLMs, embeddings, vector stores etc. |
|
Experimental |
| 52 |
FANMixco/openai-outsystems-wrapper
A simple wrapper for the OpenAI APIs for OutSystems |
|
Experimental |
| 53 |
linzeyang/minimax-python-client
An (unofficial) python native client for easy interaction with MiniMax Open Platform |
|
Experimental |
| 54 |
inkybubble/agents-01-foundation
An LLM enhanced with tools and retrieval (no memory for now) |
|
Experimental |
| 55 |
stiebo/spring-ai-samples
Spring AI, chat client, vector store, RAG, multimodality samples |
|
Experimental |
| 56 |
iChetanRaval/Steganography-Project
A professional-grade steganography platform that enables secure data hiding... |
|
Experimental |
| 57 |
jtgsystems/Ollama-Menu-main
🦙 Ollama menu system - Streamlined AI model management |
|
Experimental |
| 58 |
sanketrana2598/ai-terminology
📘 Discover key AI terms and their alternatives in this concise guide to... |
|
Experimental |
| 59 |
orchestra-mcp/pack-ai
AI and LLM integration skills with RAG, embeddings, and vector search |
|
Experimental |
| 60 |
xp-forge/openai
OpenAI APIs for XP Framework |
|
Experimental |
| 61 |
dogunkim/llmnet
🔍 Transform your local LLMs into a private, high-speed search engine,... |
|
Experimental |
| 62 |
vs4vijay/LLM-Ecosystem
Code for Embeddings, VectorStore, SemanticSearch, and RAG using Azure OpenAI |
|
Experimental |
| 63 |
dotcommander/syn
Fast CLI for the Synthetic.new AI API — chat, search, vision, embeddings.... |
|
Experimental |
| 64 |
redoh/ollama-recipes
🦙 Local LLM recipes with Ollama — fine-tuning, RAG, embeddings, multi-model pipelines |
|
Experimental |
| 65 |
zzarif/AI-Detector
Detect AI generated coding answers |
|
Experimental |
| 66 |
ArchitJ6/What-Beats-AI
What Beats AI is an interactive word-challenge game that leverages... |
|
Experimental |
| 67 |
pipewrk/llm-core
Lightweight, composable TypeScript library for semantic chunking, workflow... |
|
Experimental |
| 68 |
onfiiva/ai-assistant-api
LLM API with RAG, Agent, Embeddings and open source models |
|
Experimental |
| 69 |
johnamit/semantic-context-tokens
A hybrid tokenization framework that combines coarse semantic context tokens... |
|
Experimental |
| 70 |
Al-Aswadd/Augment-BYOK-Proxy
🚀 Transform your Augment experience with the BYOK proxy for seamless LLM... |
|
Experimental |
| 71 |
stephengroe/little-language-model
🤖 A tiny GPT-style language model built from scratch. Built to explore ML... |
|
Experimental |
| 72 |
miladhub/chat-ai
Example of using a conversational AI with embeddings with Java |
|
Experimental |
| 73 |
sonufrienko/ai-engineering
OpenAI, LLM, LangChain, LlamaIndex, Vector Search |
|
Experimental |
| 74 |
SkywardAI/lupinIII
Restful API aggregators in Rust which focus on high performance, Rust AI... |
|
Experimental |