huggingface/text-embeddings-inference

A blazing fast inference solution for text embeddings models

62
/ 100
Established

This solution helps machine learning engineers and data scientists deploy text embedding and sequence classification models for their applications. You input raw text, and it quickly outputs numerical representations (embeddings) or classification labels for use in search, recommendation, or sentiment analysis systems. It's designed for those who need to serve large volumes of text processing requests efficiently.

4,582 stars. Actively maintained with 10 commits in the last 30 days.

Use this if you need to serve text embedding or classification models at high speed and scale, minimizing latency and resource usage.

Not ideal if you are looking for a pre-built application that directly performs tasks like document search or sentiment analysis without needing to deploy models yourself.

natural-language-processing machine-learning-operations information-retrieval text-analytics model-deployment
No Package No Dependents
Maintenance 17 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

4,582

Forks

370

Language

Rust

License

Apache-2.0

Last pushed

Mar 12, 2026

Commits (30d)

10

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/huggingface/text-embeddings-inference"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.