All Document AI Tools
154 tools ranked by quality score
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your... |
|
Verified |
| 2 |
mehmet-kozan/pdf-parse
Pure TypeScript, cross-platform module for extracting text, images, and... |
|
Verified |
| 3 |
HIllya51/LunaTranslator
视觉小说翻译器 / Visual Novel Translator |
|
Verified |
| 4 |
ShareX/ShareX
ShareX is a free and open-source application that enables users to capture... |
|
Verified |
| 5 |
btwld/docling-sdk
A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem... |
|
Established |
| 6 |
STranslate/STranslate
A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具 |
|
Established |
| 7 |
tisfeng/Easydict
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎... |
|
Established |
| 8 |
zclucas/RMT
RMT (RuoMengTu) is a free, open-source macro tool built on AHKv2. Let the... |
|
Established |
| 9 |
readur/readur
Quick, painless, intuitive OCR platform written in Rust and TypeScript.... |
|
Established |
| 10 |
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis,... |
|
Established |
| 11 |
run-llama/llama-cloud-py
Python SDK for OCR and document parsing in the cloud with LlamaParse |
|
Established |
| 12 |
TheJoeFin/Text-Grab
Use OCR in Windows quickly and easily with Text Grab. With optional... |
|
Established |
| 13 |
docling-project/docling
Get your documents ready for gen AI |
|
Established |
| 14 |
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched |
|
Established |
| 15 |
RapidAI/RapidOCR
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime,... |
|
Established |
| 16 |
bpwhelan/GameSentenceMiner
An immersion toolkit for learning Languages through games and other visual media. |
|
Established |
| 17 |
datalab-to/chandra
OCR model that handles complex tables, forms, handwriting with full layout. |
|
Established |
| 18 |
xushengfeng/eSearch
截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search ... |
|
Established |
| 19 |
run-llama/liteparse
A fast, helpful, and open-source document parser |
|
Established |
| 20 |
zai-org/GLM-OCR
GLM-OCR: Accurate × Fast × Comprehensive |
|
Established |
| 21 |
pytr-org/pytr
Use TradeRepublic in terminal and mass download all documents |
|
Established |
| 22 |
CCExtractor/ccextractor
CCExtractor - Official version maintained by the core team |
|
Established |
| 23 |
felipeall/resumeio-to-pdf
Download your resume from resume.io as PDF |
|
Established |
| 24 |
mittagessen/kraken
OCR engine for all the languages |
|
Established |
| 25 |
seanghay/sone
Declarative Canvas layout engine for JavaScript with advanced rich text support. |
|
Established |
| 26 |
ballerine-io/ballerine
Open-source infrastructure and data orchestration platform for risk decisioning |
|
Established |
| 27 |
thanhkeke97/RSTGameTranslation
🎮 Real-time Game Translation Tool | OCR + AI Translation | Windows Gaming |... |
|
Established |
| 28 |
hankei6km/gas-gocr2notion
Google Drive で OCR を行い、結果を Notion データベースへ送信する Google Apps Script ライブラリー。 |
|
Established |
| 29 |
formkiq/formkiq-core
Open-source document management platform leveraging AWS managed services.... |
|
Established |
| 30 |
Achno/gowall
A tool to convert a Wallpaper's color scheme / palette, OCR with VLM's... |
|
Established |
| 31 |
RapidAI/RapidDoc
A high-performance, open-source PDF data extraction tool. ... |
|
Established |
| 32 |
ArtifexSoftware/mupdf.js
JavaScript bindings for MuPDF |
|
Established |
| 33 |
meangrinch/MangaTranslator
Manga translation app powered by AI |
|
Established |
| 34 |
oomol-lab/pdf-craft
PDF craft can convert PDF files into various other formats. This project... |
|
Established |
| 35 |
TareHimself/manga-translator
A manga translator built with python |
|
Established |
| 36 |
shibing624/imgocr
Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB),... |
|
Established |
| 37 |
zibo-chen/rust-paddle-ocr
高性能OCR识别库,支持上百种语言,提供命令行、图形界面及C API多种调用方式,使用便捷高效。 High-performance OCR... |
|
Established |
| 38 |
dynobo/normcap
OCR powered screen-capture tool to capture information instead of images |
|
Established |
| 39 |
wikimedia/wikimedia-ocr
API wrapper enabling Wikisources to submit images for optical character recognition. |
|
Established |
| 40 |
ispras/dedoc
Dedoc is a library (service) for automate documents parsing and bringing to... |
|
Established |
| 41 |
uptonking/note4yaoo
daily notes |
|
Established |
| 42 |
scribeocr/scribeocr
Web interface for recognizing text, proofreading OCR, and creating... |
|
Established |
| 43 |
bzsanti/oxidizePdf
a PDF library for rust |
|
Established |
| 44 |
UB-Mannheim/escriptorium
Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim |
|
Established |
| 45 |
arvindrajan92/DTrOCR
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical... |
|
Established |
| 46 |
ecdye/macSubtitleOCR
Convert bitmap subtitles into SubRip format using the macOS Vision framework |
|
Established |
| 47 |
rtr46/meikipop
meikipop - universal japanese ocr popup dictionary for windows, linux and macos |
|
Established |
| 48 |
mindee/mindee-api-nodejs
Mindee API Helper Library for Node.js |
|
Established |
| 49 |
ocrbase-hq/ocrbase
📄 PDF ->.MD/.JSON API & SDK for PaddleOCR-VL with structured data... |
|
Established |
| 50 |
danpla/dpscreenocr
A program to recognize text on the screen |
|
Established |
| 51 |
marieai/marie-ai
Complex data extraction and orchestration framework designed for processing... |
|
Established |
| 52 |
GeiserX/paperless-telegram-bot
Manage Paperless-NGX documents entirely through Telegram - upload, search,... |
|
Established |
| 53 |
scribeocr/scribe.js
JavaScript OCR and text extraction for images and PDFs. |
|
Established |
| 54 |
Roots-Automation/GutenOCR
Open-source tools for training and evaluating Vision Language Models for OCR |
|
Established |
| 55 |
R0Wi-DEV/workflow_ocr
This is a Nextcloud Workflow App which enables you to process files via OCR... |
|
Established |
| 56 |
Purfview/InpaintDelogo
Advanced delogo plugin for AviSynth+ |
|
Established |
| 57 |
doo/scanbot-sdk-example-flutter
Easy-to-use Flutter document scanner and data extraction plugin |
|
Established |
| 58 |
ZANdewanai/Genshin-Impact-Rich-Presence
Genshin Impact Rich Presence |
|
Established |
| 59 |
NitishKumar-ai/PersonalLearningPro
PersonalLearningPro is an open‑source, AI-powered school learning platform... |
|
Established |
| 60 |
wxyhgk/retain-pdf
在保留版面、公式与结构的前提下进行 PDF 翻译,适用于科研与技术文档 |
|
Established |
| 61 |
wolfmanstout/screen-ocr
Easily perform OCR on portions of the screen, choosing from a selection of backends. |
|
Established |
| 62 |
OuterSpaceHobo/ScanLingua
Free open source chrome extension for immersive japanese, chinese and korean... |
|
Emerging |
| 63 |
dills122/MTG-Card-Analyzer
(🚧) Analyze images of MTG cards (Clean up in progress) |
|
Emerging |
| 64 |
SnapXL/SnapX
SnapX is a free, open-source, cross-platform tool that lets you capture or... |
|
Emerging |
| 65 |
workoss/boot
常用的工具或者服务 |
|
Emerging |
| 66 |
yomihon/yomihon
Free and open source manga reader for Android - now with OCR (text recognition) |
|
Emerging |
| 67 |
monchin/tablers
A blazingly fast PDF table extraction library with python API powered by Rust |
|
Emerging |
| 68 |
CarvingIT/smart-repository
Institutional Repository of Knowledge |
|
Emerging |
| 69 |
yash2974/Zenpark
🔐 Zenpark – Smart Parking Management System Zenpark is an AI-powered smart... |
|
Emerging |
| 70 |
ZingYao/autogo_scriptengine
AutoGo 脚本引擎扩展方案 - 为 AutoGo 提供 JavaScript 和 Lua 双引擎支持,包含 20+... |
|
Emerging |
| 71 |
zai-org/GLM-skills
Official skills for the GLM family of models. |
|
Emerging |
| 72 |
run-llama/llama-cloud-ts
Typescript SDK for OCR and document parsing in the cloud with LlamaParse |
|
Emerging |
| 73 |
thomaswantstobeaskeleton/BallonsTranslator-Pro
BallonsTranslatorPro — Community fork of BallonsTranslator.... |
|
Emerging |
| 74 |
nordie92/AoE4BO
Age of Empires 4 overlay to lern build orders |
|
Emerging |
| 75 |
veryfi/veryfi-nodejs
Node.js module for communicating with the Veryfi OCR API. || read:... |
|
Emerging |
| 76 |
openva/rs-video-processor
The video OCR processor for Richmond Sunlight. |
|
Emerging |
| 77 |
doo/scanbot-sdk-example-ios
Easy-to-use iOS document scanner and data extraction library for native iOS apps |
|
Emerging |
| 78 |
mindee/mindee-api-ruby
Mindee API Helper Library for Ruby |
|
Emerging |
| 79 |
Ronin-CK/QuickSnip
⚡ Lightweight Wayland OCR & Google Lens utility built with Quickshell. |
|
Emerging |
| 80 |
ieasybooks/tahweel.rb
تحويل ملفات PDF إلى TXT و DOCX و JSON |
|
Emerging |
| 81 |
jasperdevs/yoink
Free, open-source screenshot tool. Capture, annotate, and share with a single hotkey. |
|
Emerging |
| 82 |
firecrawl/pdf-inspector
Fast Rust library for PDF inspection, classification, and text extraction.... |
|
Emerging |
| 83 |
Cilda/UmaUmaChecker
ウマ娘で選択肢のステータスを画像認識によって可視化する |
|
Emerging |
| 84 |
NirKli/WattBot
Smart OCR app for reading electricity meters from images |
|
Emerging |
| 85 |
simpledms/simpledms
Document management for small businesses. |
|
Emerging |
| 86 |
doo/scanbot-sdk-maui-example
Easy-to-use .NET MAUI document scanner and data extraction library |
|
Emerging |
| 87 |
regulaforensics/DocumentReader-web-js-client
Regula Document Reader web API js client for the browser and node.js based on axios |
|
Emerging |
| 88 |
prabesh704/ImageOCR
🖼️ Build an offline image search engine that indexes images and text for... |
|
Emerging |
| 89 |
SeseydOw/Captcha-Bypass-Tool
pentesting tool to bypass captcha Steam, Gmail, Instagram, Facebook,... |
|
Emerging |
| 90 |
Sixt/tensorlake-go
Go SDK for the Tensorlake API: document intelligence, cloud sandboxes, PTY... |
|
Emerging |
| 91 |
Agions/HardSubX
A professional video hard subtitle extraction tool with OCR. Extract... |
|
Emerging |
| 92 |
veryfi/veryfi-lens-react-native-demo
Example Demo App about how to use our react native wrapper |
|
Emerging |
| 93 |
Akronae/windows_media_ocr_cli
🔎 OCR CLI that outputs structured data with bounding rects using local... |
|
Emerging |
| 94 |
PELock/Dekoder-AZTEC-2D-JavaScript
Dekoder Kodu AZTEC 2D z Dowodu Rejestracyjnego dla JavaScript i Node.js (Web... |
|
Emerging |
| 95 |
1003129155/jietuba
A screenshot ocr and clipboard manager, available only on Windows 10+. ... |
|
Emerging |
| 96 |
abishekgiri/boring-ai
Self-hosted AI expense manager that turns receipts into structured data and... |
|
Emerging |
| 97 |
clark-labs-inc/pdfsink-rs
Fast pure-Rust PDF extraction library and CLI by Clark Labs Inc. — 10–50x... |
|
Emerging |
| 98 |
narzaut/translator
Translator Overlay |
|
Emerging |
| 99 |
hcmhcs/screenTranslate
Screen translation app for macOS — select any area, get instant translation.... |
|
Emerging |
| 100 |
nonwill/nonwill.github.io
CDN data of www.autoptr.top. |
|
Emerging |