huggingface/text-embeddings-inference

A blazing fast inference solution for text embeddings models

/ 100

Established

This solution helps machine learning engineers and data scientists deploy text embedding and sequence classification models for their applications. You input raw text, and it quickly outputs numerical representations (embeddings) or classification labels for use in search, recommendation, or sentiment analysis systems. It's designed for those who need to serve large volumes of text processing requests efficiently.

4,582 stars. Actively maintained with 10 commits in the last 30 days.

Use this if you need to serve text embedding or classification models at high speed and scale, minimizing latency and resource usage.

Not ideal if you are looking for a pre-built application that directly performs tasks like document search or sentiment analysis without needing to deploy models yourself.

natural-language-processing machine-learning-operations information-retrieval text-analytics model-deployment

No Package No Dependents

Maintenance 17 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

4,582

Forks

370

Language

Rust

License

Apache-2.0

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead

Related tools

Anush008/fastembed-rs

Rust library for vector embeddings and reranking.

MinishLab/model2vec-rs

Official Rust Implementation of Model2Vec

finalfusion/finalfusion-rust

finalfusion embeddings in Rust

finalfusion/finalfusion-python

Finalfusion embeddings in Python

olafurjohannsson/kjarni

Native ML inference engine — embeddings, classification, reranking, search, and text generation....

Explore Embedding Tools

All categories Trending Embeddings directory Insights