Azure-Samples/multimodal_rag_python

Python notebook for solving overlapping tables problem with Azure document intelligence , semantic chunking, RAG , Azure AI Search

27
/ 100
Experimental

This project helps operations managers, data analysts, or researchers extract and understand complex information from documents containing large tables that span multiple pages and lack clear headers. It takes your PDF documents as input and outputs structured information, making it easier to analyze and retrieve specific data points that might otherwise be missed. This is particularly useful for anyone dealing with detailed reports, financial statements, or technical specifications where data is often presented in challenging, multi-page table formats.

No commits in the last 6 months.

Use this if you frequently need to accurately extract and query information from PDF documents containing hierarchical tables that are spread across multiple pages without consistent headers.

Not ideal if your documents primarily contain simple text or well-structured, single-page tables with clear headers.

document-analysis data-extraction information-retrieval report-processing unstructured-data
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 14 / 25

How are scores calculated?

Stars

9

Forks

3

Language

Jupyter Notebook

License

Last pushed

Sep 04, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/Azure-Samples/multimodal_rag_python"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.