Orion-zhen/transAPI
OpenAI compatible API purely based on Transformers
This project helps developers test large language models (LLMs) that aren't supported by common inference engines like vLLM or HuggingFace TGI. It takes any model compatible with HuggingFace Transformers and makes it accessible through an OpenAI-compatible API. This allows developers to quickly integrate and experiment with a wider range of models for their applications.
No commits in the last 6 months.
Use this if you are a developer experimenting with less common or newly released large language models that are supported by HuggingFace Transformers but not by other popular inference backends, and you need an OpenAI-compatible interface for testing.
Not ideal if you need a production-ready solution with advanced features like continuous batching, paged attention, or OpenAI-style tool calling, or if the model you are using is already supported by vLLM or TGI.
Stars
9
Forks
1
Language
Python
License
AGPL-3.0
Category
Last pushed
Aug 09, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Orion-zhen/transAPI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
golsun/DialogRPT
EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"
godatadriven/rhyme-with-ai
Rhyme with AI
ArchAIve-Project/Backend
A complex Flask API system empowered by custom ML models, LLMs and processing to facilitate...
flozi00/atra
An open source NLP as a service project focused on providing state of the art systems with ease....
declare-lab/CICERO
The purpose of this repository is to introduce new dialogue-level commonsense inference datasets...