huggingface/text-generation-inference

Large Language Model Text Generation Inference

68
/ 100
Established

This project helps deploy and serve large language models (LLMs) for generating text efficiently. It takes a chosen LLM and a text prompt as input, then generates a natural language response or completion. This is ideal for machine learning engineers or developers looking to integrate powerful text generation into their applications or services.

10,802 stars. Used by 3 other packages. Actively maintained with 1 commit in the last 30 days. Available on PyPI.

Use this if you are a machine learning engineer or developer needing to deploy and serve a large language model for text generation with high performance and specific features like streaming or structured output.

Not ideal if you are looking for a pre-built application that directly solves an end-user problem, as this is an infrastructure tool for developers.

LLM deployment MLOps text generation AI infrastructure model serving
Maintenance 9 / 25
Adoption 13 / 25
Maturity 25 / 25
Community 21 / 25

How are scores calculated?

Stars

10,802

Forks

1,261

Language

Python

License

Apache-2.0

Last pushed

Jan 08, 2026

Commits (30d)

1

Dependencies

3

Reverse dependents

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/huggingface/text-generation-inference"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.