kozistr/triton-grpc-proxy-rs

Proxy server for triton gRPC server that inferences embedding model in Rust

/ 100

Emerging

This project helps developers serve machine learning models that generate embeddings from text. It takes raw text inputs and outputs numerical vector representations (embeddings) that can be used for tasks like search, recommendation, or classification. It's designed for developers building applications that need fast and efficient access to text embedding models.

No commits in the last 6 months.

Use this if you are a developer looking for a fast, simple, and dependency-free way to expose a text embedding model (like BAAI/bge-m3) via a gRPC API, abstracting away the complexities of the Triton Inference Server.

Not ideal if you need to serve non-embedding models, require custom pre-processing logic beyond simple text conversion, or are not comfortable with Rust or Docker deployments.

Machine Learning Infrastructure Text Embeddings API Development Microservices AI Model Deployment

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Rust

License

Apache-2.0

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead

Higher-rated alternatives

Anush008/fastembed-rs

Rust library for vector embeddings and reranking.

huggingface/text-embeddings-inference

A blazing fast inference solution for text embeddings models

MinishLab/model2vec-rs

Official Rust Implementation of Model2Vec

finalfusion/finalfusion-rust

finalfusion embeddings in Rust

finalfusion/finalfusion-python

Finalfusion embeddings in Python

Explore Embedding Tools

All categories Trending Embeddings directory Insights