arshad-yaseen/ocr-llm
⚡️ Fast, ultra-accurate text extraction from any image or PDF—including challenging ones—with structured markdown output powered by vision models.
This tool helps you quickly and accurately convert text from any image or PDF document into clean, structured Markdown. It can handle even challenging inputs like low-quality scans, handwriting, tables, equations, and complex layouts. Anyone who needs to extract and reuse content from visual documents, such as researchers, data entry specialists, or content managers, will find this useful.
No commits in the last 6 months. Available on npm.
Use this if you need to reliably extract text from a wide variety of image and PDF formats, including those with intricate designs or poor quality, and want the output in a clean Markdown format.
Not ideal if you need to process PDFs directly within a browser or serverless environment without any backend system dependencies, unless you implement a hybrid client-server approach.
Stars
39
Forks
5
Language
TypeScript
License
MIT
Category
Last pushed
Jan 09, 2025
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/arshad-yaseen/ocr-llm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NanoNets/docstrange
Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple...
th1nhhdk/local_ai_ocr
An local, offline (after initial setup), portable OCR software that can process images and PDF...
Dicklesworthstone/llm_aided_ocr
Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking,...
emcf/thepipe
Get clean data from tricky documents, powered by vision-language models ⚡
langstruct-ai/langstruct
Extract structured data from any content using LLMs.