Azure-Samples/rag-as-a-service-with-vision

This repository offers a Python framework for a retrieval-augmented generation (RAG) pipeline using text and images from MHTML documents, leveraging Azure AI and OpenAI services. It includes ingestion and enrichment flows, a RAG with Vision pipeline, and evaluation tools.

45
/ 100
Emerging

This framework helps knowledge workers like researchers or analysts quickly get answers from complex documents that mix text and images, such as web archives or reports. You feed it MHTML files, and it uses AI to understand both the words and pictures, then provides precise answers to your questions. It's designed for anyone needing to extract insights from rich, multi-modal content.

Use this if you need to build a system that can accurately answer questions by searching and understanding both text and images within archived web pages or other MHTML documents.

Not ideal if your documents are purely text-based or if you don't need to incorporate visual information from images into your search and answer generation.

knowledge-management document-intelligence enterprise-search content-analysis information-retrieval
No Package No Dependents
Maintenance 6 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

34

Forks

7

Language

Python

License

MIT

Last pushed

Nov 17, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/Azure-Samples/rag-as-a-service-with-vision"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.