All Document AI Tools

154 tools ranked by quality score

Showing 1–100 of 154
# Tool Score Tier
1 opendatalab/MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your...

80
Verified
2 mehmet-kozan/pdf-parse

Pure TypeScript, cross-platform module for extracting text, images, and...

76
Verified
3 HIllya51/LunaTranslator

视觉小说翻译器 / Visual Novel Translator

71
Verified
4 ShareX/ShareX

ShareX is a free and open-source application that enables users to capture...

71
Verified
5 btwld/docling-sdk

A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem...

69
Established
6 STranslate/STranslate

A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具

69
Established
7 tisfeng/Easydict

一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎...

68
Established
8 zclucas/RMT

RMT (RuoMengTu) is a free, open-source macro tool built on AHKv2. Let the...

68
Established
9 readur/readur

Quick, painless, intuitive OCR platform written in Rust and TypeScript....

68
Established
10 pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis,...

68
Established
11 run-llama/llama-cloud-py

Python SDK for OCR and document parsing in the cloud with LlamaParse

67
Established
12 TheJoeFin/Text-Grab

Use OCR in Windows quickly and easily with Text Grab. With optional...

67
Established
13 docling-project/docling

Get your documents ready for gen AI

67
Established
14 ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

67
Established
15 RapidAI/RapidOCR

📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime,...

66
Established
16 bpwhelan/GameSentenceMiner

An immersion toolkit for learning Languages through games and other visual media.

65
Established
17 datalab-to/chandra

OCR model that handles complex tables, forms, handwriting with full layout.

65
Established
18 xushengfeng/eSearch

截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search ...

65
Established
19 run-llama/liteparse

A fast, helpful, and open-source document parser

64
Established
20 zai-org/GLM-OCR

GLM-OCR: Accurate × Fast × Comprehensive

64
Established
21 pytr-org/pytr

Use TradeRepublic in terminal and mass download all documents

64
Established
22 CCExtractor/ccextractor

CCExtractor - Official version maintained by the core team

64
Established
23 felipeall/resumeio-to-pdf

Download your resume from resume.io as PDF

63
Established
24 mittagessen/kraken

OCR engine for all the languages

62
Established
25 seanghay/sone

Declarative Canvas layout engine for JavaScript with advanced rich text support.

60
Established
26 ballerine-io/ballerine

Open-source infrastructure and data orchestration platform for risk decisioning

60
Established
27 thanhkeke97/RSTGameTranslation

🎮 Real-time Game Translation Tool | OCR + AI Translation | Windows Gaming |...

59
Established
28 hankei6km/gas-gocr2notion

Google Drive で OCR を行い、結果を Notion データベースへ送信する Google Apps Script ライブラリー。

59
Established
29 formkiq/formkiq-core

Open-source document management platform leveraging AWS managed services....

58
Established
30 Achno/gowall

A tool to convert a Wallpaper's color scheme / palette, OCR with VLM's...

58
Established
31 RapidAI/RapidDoc

A high-performance, open-source PDF data extraction tool. ...

58
Established
32 ArtifexSoftware/mupdf.js

JavaScript bindings for MuPDF

58
Established
33 meangrinch/MangaTranslator

Manga translation app powered by AI

57
Established
34 oomol-lab/pdf-craft

PDF craft can convert PDF files into various other formats. This project...

57
Established
35 TareHimself/manga-translator

A manga translator built with python

56
Established
36 shibing624/imgocr

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB),...

56
Established
37 zibo-chen/rust-paddle-ocr

高性能OCR识别库,支持上百种语言,提供命令行、图形界面及C API多种调用方式,使用便捷高效。 High-performance OCR...

56
Established
38 dynobo/normcap

OCR powered screen-capture tool to capture information instead of images

55
Established
39 wikimedia/wikimedia-ocr

API wrapper enabling Wikisources to submit images for optical character recognition.

55
Established
40 ispras/dedoc

Dedoc is a library (service) for automate documents parsing and bringing to...

55
Established
41 uptonking/note4yaoo

daily notes

54
Established
42 scribeocr/scribeocr

Web interface for recognizing text, proofreading OCR, and creating...

53
Established
43 bzsanti/oxidizePdf

a PDF library for rust

53
Established
44 UB-Mannheim/escriptorium

Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim

53
Established
45 arvindrajan92/DTrOCR

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical...

53
Established
46 ecdye/macSubtitleOCR

Convert bitmap subtitles into SubRip format using the macOS Vision framework

53
Established
47 rtr46/meikipop

meikipop - universal japanese ocr popup dictionary for windows, linux and macos

51
Established
48 mindee/mindee-api-nodejs

Mindee API Helper Library for Node.js

51
Established
49 ocrbase-hq/ocrbase

📄 PDF ->.MD/.JSON API & SDK for PaddleOCR-VL with structured data...

51
Established
50 danpla/dpscreenocr

A program to recognize text on the screen

51
Established
51 marieai/marie-ai

Complex data extraction and orchestration framework designed for processing...

51
Established
52 GeiserX/paperless-telegram-bot

Manage Paperless-NGX documents entirely through Telegram - upload, search,...

51
Established
53 scribeocr/scribe.js

JavaScript OCR and text extraction for images and PDFs.

51
Established
54 Roots-Automation/GutenOCR

Open-source tools for training and evaluating Vision Language Models for OCR

50
Established
55 R0Wi-DEV/workflow_ocr

This is a Nextcloud Workflow App which enables you to process files via OCR...

50
Established
56 Purfview/InpaintDelogo

Advanced delogo plugin for AviSynth+

50
Established
57 doo/scanbot-sdk-example-flutter

Easy-to-use Flutter document scanner and data extraction plugin

50
Established
58 ZANdewanai/Genshin-Impact-Rich-Presence

Genshin Impact Rich Presence

50
Established
59 NitishKumar-ai/PersonalLearningPro

PersonalLearningPro is an open‑source, AI-powered school learning platform...

50
Established
60 wxyhgk/retain-pdf

在保留版面、公式与结构的前提下进行 PDF 翻译,适用于科研与技术文档

50
Established
61 wolfmanstout/screen-ocr

Easily perform OCR on portions of the screen, choosing from a selection of backends.

50
Established
62 OuterSpaceHobo/ScanLingua

Free open source chrome extension for immersive japanese, chinese and korean...

49
Emerging
63 dills122/MTG-Card-Analyzer

(🚧) Analyze images of MTG cards (Clean up in progress)

48
Emerging
64 SnapXL/SnapX

SnapX is a free, open-source, cross-platform tool that lets you capture or...

48
Emerging
65 workoss/boot

常用的工具或者服务

48
Emerging
66 yomihon/yomihon

Free and open source manga reader for Android - now with OCR (text recognition)

46
Emerging
67 monchin/tablers

A blazingly fast PDF table extraction library with python API powered by Rust

46
Emerging
68 CarvingIT/smart-repository

Institutional Repository of Knowledge

46
Emerging
69 yash2974/Zenpark

🔐 Zenpark – Smart Parking Management System Zenpark is an AI-powered smart...

45
Emerging
70 ZingYao/autogo_scriptengine

AutoGo 脚本引擎扩展方案 - 为 AutoGo 提供 JavaScript 和 Lua 双引擎支持,包含 20+...

45
Emerging
71 zai-org/GLM-skills

Official skills for the GLM family of models.

45
Emerging
72 run-llama/llama-cloud-ts

Typescript SDK for OCR and document parsing in the cloud with LlamaParse

45
Emerging
73 thomaswantstobeaskeleton/BallonsTranslator-Pro

BallonsTranslatorPro — Community fork of BallonsTranslator....

45
Emerging
74 nordie92/AoE4BO

Age of Empires 4 overlay to lern build orders

45
Emerging
75 veryfi/veryfi-nodejs

Node.js module for communicating with the Veryfi OCR API. || read:...

45
Emerging
76 openva/rs-video-processor

The video OCR processor for Richmond Sunlight.

44
Emerging
77 doo/scanbot-sdk-example-ios

Easy-to-use iOS document scanner and data extraction library for native iOS apps

44
Emerging
78 mindee/mindee-api-ruby

Mindee API Helper Library for Ruby

44
Emerging
79 Ronin-CK/QuickSnip

⚡ Lightweight Wayland OCR & Google Lens utility built with Quickshell.

43
Emerging
80 ieasybooks/tahweel.rb

تحويل ملفات PDF إلى TXT و DOCX و JSON

43
Emerging
81 jasperdevs/yoink

Free, open-source screenshot tool. Capture, annotate, and share with a single hotkey.

42
Emerging
82 firecrawl/pdf-inspector

Fast Rust library for PDF inspection, classification, and text extraction....

42
Emerging
83 Cilda/UmaUmaChecker

ウマ娘で選択肢のステータスを画像認識によって可視化する

42
Emerging
84 NirKli/WattBot

Smart OCR app for reading electricity meters from images

42
Emerging
85 simpledms/simpledms

Document management for small businesses.

41
Emerging
86 doo/scanbot-sdk-maui-example

Easy-to-use .NET MAUI document scanner and data extraction library

41
Emerging
87 regulaforensics/DocumentReader-web-js-client

Regula Document Reader web API js client for the browser and node.js based on axios

41
Emerging
88 prabesh704/ImageOCR

🖼️ Build an offline image search engine that indexes images and text for...

41
Emerging
89 SeseydOw/Captcha-Bypass-Tool

pentesting tool to bypass captcha Steam, Gmail, Instagram, Facebook,...

40
Emerging
90 Sixt/tensorlake-go

Go SDK for the Tensorlake API: document intelligence, cloud sandboxes, PTY...

40
Emerging
91 Agions/HardSubX

A professional video hard subtitle extraction tool with OCR. Extract...

40
Emerging
92 veryfi/veryfi-lens-react-native-demo

Example Demo App about how to use our react native wrapper

39
Emerging
93 Akronae/windows_media_ocr_cli

🔎 OCR CLI that outputs structured data with bounding rects using local...

38
Emerging
94 PELock/Dekoder-AZTEC-2D-JavaScript

Dekoder Kodu AZTEC 2D z Dowodu Rejestracyjnego dla JavaScript i Node.js (Web...

37
Emerging
95 1003129155/jietuba

A screenshot ocr and clipboard manager, available only on Windows 10+. ...

37
Emerging
96 abishekgiri/boring-ai

Self-hosted AI expense manager that turns receipts into structured data and...

36
Emerging
97 clark-labs-inc/pdfsink-rs

Fast pure-Rust PDF extraction library and CLI by Clark Labs Inc. — 10–50x...

36
Emerging
98 narzaut/translator

Translator Overlay

35
Emerging
99 hcmhcs/screenTranslate

Screen translation app for macOS — select any area, get instant translation....

35
Emerging
100 nonwill/nonwill.github.io

CDN data of www.autoptr.top.

35
Emerging