Ariyan-Pro/RAG-Latency-Optimization

CPU-optimized RAG pipeline reducing latency 2.7× (247ms → 92ms). Implements caching, filtering, quantization for production. Complete with FastAPI, Docker, benchmarks, investor materials. The engineering showcase that sells itself.

/ 100

Experimental

No License No Package No Dependents

Maintenance 10 / 25

Adoption 0 / 25

Maturity 3 / 25

Community 0 / 25

How are scores calculated?

Stars

—

Forks

—

Language

Python

License

—

Category

rag-techniques-frameworks

Last pushed

Jan 24, 2026

Commits (30d)

GitHub

Rag Techniques Frameworks · 34 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/Ariyan-Pro/RAG-Latency-Optimization"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

Marker-Inc-Korea/AutoRAG

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation &...

jxzhangjhu/Awesome-LLM-RAG

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

IntelLabs/RAG-FiT

Framework for enhancing LLMs for RAG tasks using fine-tuning.

coree/awesome-rag

A curated list of retrieval-augmented generation (RAG) in large language models

IntelLabs/fastRAG

Efficient Retrieval Augmentation and Generation Framework

Explore RAG Tools

All categories Trending RAG directory Insights