tahangz/Multimodal_OCR_LLM

This project is a user-friendly web application that allows you to upload PDFs, DOCX files, or images, automatically extracts text using advanced OCR techniques, and generates concise summaries using Google Gemini 2.5 Flash via LangChain. Built with Streamlit, it provides a seamless experience for document understanding and quick insight extraction

13
/ 100
Experimental

This web application helps students, researchers, and professionals quickly understand information from various documents. You can upload PDFs, DOCX files, or images, and it will automatically extract the text. Then, it uses AI to generate a concise summary of the content, saving you time and effort.

No commits in the last 6 months.

Use this if you need to quickly get the main points from scanned documents, reports, or articles without reading through everything.

Not ideal if you need to process extremely long documents with very fine-grained summaries, or if you require offline processing without an internet connection for the AI summarization.

document-analysis research-summary information-extraction report-digestion content-briefing
No License Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 4 / 25
Maturity 7 / 25
Community 0 / 25

How are scores calculated?

Stars

7

Forks

Language

Python

License

Last pushed

Aug 06, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/tahangz/Multimodal_OCR_LLM"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.