flashserve/RAGPulse

An Open-Source RAG Workload Trace to Optimize RAG Serving Systems

32
/ 100
Emerging

This project provides a realistic dataset of user interactions with a Q&A system that uses Retrieval Augmented Generation (RAG). It includes anonymized records of user questions, system prompts, retrieved information, and chat history. AI infrastructure engineers can use this data to test and improve the performance of their RAG-based systems.

Use this if you need to optimize the speed and efficiency of your RAG serving infrastructure using real-world usage patterns.

Not ideal if you are looking for raw conversational text or a dataset for training RAG models, as all content is replaced with anonymized hash IDs.

AI infrastructure RAG systems System optimization Performance testing Workload analysis
No Package No Dependents
Maintenance 6 / 25
Adoption 7 / 25
Maturity 13 / 25
Community 6 / 25

How are scores calculated?

Stars

35

Forks

2

Language

Python

License

MIT

Last pushed

Nov 18, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/flashserve/RAGPulse"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.