ML API Deployment Transformer Models

Tools and frameworks for deploying transformer models as production-ready APIs using FastAPI, Flask, or similar web services with containerization and inference optimization. Does NOT include model training, fine-tuning frameworks, or non-API deployment methods like static model serving.

There are 24 ml api deployment models tracked. The highest-rated is golsun/DialogRPT at 42/100 with 345 stars.

Get all 24 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=ml-api-deployment&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 golsun/DialogRPT

EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"

42
Emerging
2 godatadriven/rhyme-with-ai

Rhyme with AI

41
Emerging
3 ArchAIve-Project/Backend

A complex Flask API system empowered by custom ML models, LLMs and...

39
Emerging
4 flozi00/atra

An open source NLP as a service project focused on providing state of the...

37
Emerging
5 declare-lab/CICERO

The purpose of this repository is to introduce new dialogue-level...

35
Emerging
6 henrikalbihn/gliner-as-a-service

GLiNER model in a FastAPI microservice.

34
Emerging
7 samestrin/llm-services-api

A FastAPI-powered REST API offering a comprehensive suite of natural...

30
Emerging
8 Orion-zhen/transAPI

OpenAI compatible API purely based on Transformers

30
Emerging
9 spyker77/fastapi-tdd-docker

Transformers with test-driven development

29
Experimental
10 IsmaelMousa/TTL

Full-stack simulator for a todo task list application using FastAPI, I built...

27
Experimental
11 imsigma1/AI-Knowledge-Creativity

🧠 Power AI-driven tools for creative exploration and knowledge retrieval,...

25
Experimental
12 anar-rzayev/Empathetic-Dialogue-Generation

Open-Domain Dialogue model which produces empathetic responses when trained...

23
Experimental
13 mxchinegod/digits-api-ml

digits-api-ml is a large suite of API endpoints that directly respond with...

22
Experimental
14 chamajay/deepsense-backend

Backend server of DeepSense. Provides an API to access machine learning models.

21
Experimental
15 DoyoungBok/genai-docker-api

Dockerized FastAPI inference API using Hugging Face Transformers (FLAN-T5)...

21
Experimental
16 henrikalbihn/gliclass-as-a-service

GLiClass model in a FastAPI microservice.

20
Experimental
17 NguyenDucAnh-2k6/OOP_Logistics_project

Disaster Logistics App - A desktop solution for disaster aiding

16
Experimental
18 Linutesto/Fractal-Neurons-LILA_JAILBREAK

Fractal Neurons — fractal MoE + conversational tooling (7950X/4090 tuned)

13
Experimental
19 itsmyfacade/itsmyfacade

Production-grade machine learning systems, model inference pipelines, and...

13
Experimental
20 KG87/ml-api-project

Production-ready ML API with FastAPI, Flan-T5, response caching, and...

12
Experimental
21 SosoTlm/AI-Template

An Simple AI Template with coding abilities to help you code in various languages

11
Experimental
22 Aya-Jafar/AI-services-API

Django API for multiple AI medical services

11
Experimental
23 darshan-vijay/Plant-Doc

Detects plant diseases from leaf images using a fine-tuned ResNet model,...

10
Experimental
24 darkbits018/pup-predict-backend

Backend for AI powered Dog Breed Prediction App

10
Experimental