Uncategorized Document AI Tools

There are 154 uncategorized tools tracked. 4 score above 70 (verified tier). The highest-rated is opendatalab/MinerU at 80/100 with 59,166 stars. 8 of the top 10 are actively maintained.

Get all 154 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=document-ai&subcategory=uncategorized&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 opendatalab/MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your...

80
Verified
2 mehmet-kozan/pdf-parse

Pure TypeScript, cross-platform module for extracting text, images, and...

76
Verified
3 HIllya51/LunaTranslator

视觉小说翻译器 / Visual Novel Translator

71
Verified
4 ShareX/ShareX

ShareX is a free and open-source application that enables users to capture...

71
Verified
5 btwld/docling-sdk

A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem...

69
Established
6 STranslate/STranslate

A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具

69
Established
7 tisfeng/Easydict

一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎...

68
Established
8 zclucas/RMT

RMT (RuoMengTu) is a free, open-source macro tool built on AHKv2. Let the...

68
Established
9 readur/readur

Quick, painless, intuitive OCR platform written in Rust and TypeScript....

68
Established
10 pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis,...

68
Established
11 run-llama/llama-cloud-py

Python SDK for OCR and document parsing in the cloud with LlamaParse

67
Established
12 TheJoeFin/Text-Grab

Use OCR in Windows quickly and easily with Text Grab. With optional...

67
Established
13 docling-project/docling

Get your documents ready for gen AI

67
Established
14 ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

67
Established
15 RapidAI/RapidOCR

📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime,...

66
Established
16 bpwhelan/GameSentenceMiner

An immersion toolkit for learning Languages through games and other visual media.

65
Established
17 datalab-to/chandra

OCR model that handles complex tables, forms, handwriting with full layout.

65
Established
18 xushengfeng/eSearch

截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search ...

65
Established
19 run-llama/liteparse

A fast, helpful, and open-source document parser

64
Established
20 zai-org/GLM-OCR

GLM-OCR: Accurate × Fast × Comprehensive

64
Established
21 pytr-org/pytr

Use TradeRepublic in terminal and mass download all documents

64
Established
22 CCExtractor/ccextractor

CCExtractor - Official version maintained by the core team

64
Established
23 felipeall/resumeio-to-pdf

Download your resume from resume.io as PDF

63
Established
24 mittagessen/kraken

OCR engine for all the languages

62
Established
25 seanghay/sone

Declarative Canvas layout engine for JavaScript with advanced rich text support.

60
Established
26 ballerine-io/ballerine

Open-source infrastructure and data orchestration platform for risk decisioning

60
Established
27 thanhkeke97/RSTGameTranslation

🎮 Real-time Game Translation Tool | OCR + AI Translation | Windows Gaming |...

59
Established
28 hankei6km/gas-gocr2notion

Google Drive で OCR を行い、結果を Notion データベースへ送信する Google Apps Script ライブラリー。

59
Established
29 formkiq/formkiq-core

Open-source document management platform leveraging AWS managed services....

58
Established
30 Achno/gowall

A tool to convert a Wallpaper's color scheme / palette, OCR with VLM's...

58
Established
31 RapidAI/RapidDoc

A high-performance, open-source PDF data extraction tool. ...

58
Established
32 ArtifexSoftware/mupdf.js

JavaScript bindings for MuPDF

58
Established
33 meangrinch/MangaTranslator

Manga translation app powered by AI

57
Established
34 oomol-lab/pdf-craft

PDF craft can convert PDF files into various other formats. This project...

57
Established
35 TareHimself/manga-translator

A manga translator built with python

56
Established
36 shibing624/imgocr

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB),...

56
Established
37 zibo-chen/rust-paddle-ocr

高性能OCR识别库,支持上百种语言,提供命令行、图形界面及C API多种调用方式,使用便捷高效。 High-performance OCR...

56
Established
38 dynobo/normcap

OCR powered screen-capture tool to capture information instead of images

55
Established
39 wikimedia/wikimedia-ocr

API wrapper enabling Wikisources to submit images for optical character recognition.

55
Established
40 ispras/dedoc

Dedoc is a library (service) for automate documents parsing and bringing to...

55
Established
41 uptonking/note4yaoo

daily notes

54
Established
42 scribeocr/scribeocr

Web interface for recognizing text, proofreading OCR, and creating...

53
Established
43 bzsanti/oxidizePdf

a PDF library for rust

53
Established
44 UB-Mannheim/escriptorium

Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim

53
Established
45 arvindrajan92/DTrOCR

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical...

53
Established
46 ecdye/macSubtitleOCR

Convert bitmap subtitles into SubRip format using the macOS Vision framework

53
Established
47 rtr46/meikipop

meikipop - universal japanese ocr popup dictionary for windows, linux and macos

51
Established
48 mindee/mindee-api-nodejs

Mindee API Helper Library for Node.js

51
Established
49 ocrbase-hq/ocrbase

📄 PDF ->.MD/.JSON API & SDK for PaddleOCR-VL with structured data...

51
Established
50 danpla/dpscreenocr

A program to recognize text on the screen

51
Established
51 marieai/marie-ai

Complex data extraction and orchestration framework designed for processing...

51
Established
52 GeiserX/paperless-telegram-bot

Manage Paperless-NGX documents entirely through Telegram - upload, search,...

51
Established
53 scribeocr/scribe.js

JavaScript OCR and text extraction for images and PDFs.

51
Established
54 Roots-Automation/GutenOCR

Open-source tools for training and evaluating Vision Language Models for OCR

50
Established
55 R0Wi-DEV/workflow_ocr

This is a Nextcloud Workflow App which enables you to process files via OCR...

50
Established
56 Purfview/InpaintDelogo

Advanced delogo plugin for AviSynth+

50
Established
57 doo/scanbot-sdk-example-flutter

Easy-to-use Flutter document scanner and data extraction plugin

50
Established
58 ZANdewanai/Genshin-Impact-Rich-Presence

Genshin Impact Rich Presence

50
Established
59 NitishKumar-ai/PersonalLearningPro

PersonalLearningPro is an open‑source, AI-powered school learning platform...

50
Established
60 wxyhgk/retain-pdf

在保留版面、公式与结构的前提下进行 PDF 翻译,适用于科研与技术文档

50
Established
61 wolfmanstout/screen-ocr

Easily perform OCR on portions of the screen, choosing from a selection of backends.

50
Established
62 OuterSpaceHobo/ScanLingua

Free open source chrome extension for immersive japanese, chinese and korean...

49
Emerging
63 dills122/MTG-Card-Analyzer

(🚧) Analyze images of MTG cards (Clean up in progress)

48
Emerging
64 SnapXL/SnapX

SnapX is a free, open-source, cross-platform tool that lets you capture or...

48
Emerging
65 workoss/boot

常用的工具或者服务

48
Emerging
66 yomihon/yomihon

Free and open source manga reader for Android - now with OCR (text recognition)

46
Emerging
67 monchin/tablers

A blazingly fast PDF table extraction library with python API powered by Rust

46
Emerging
68 CarvingIT/smart-repository

Institutional Repository of Knowledge

46
Emerging
69 yash2974/Zenpark

🔐 Zenpark – Smart Parking Management System Zenpark is an AI-powered smart...

45
Emerging
70 ZingYao/autogo_scriptengine

AutoGo 脚本引擎扩展方案 - 为 AutoGo 提供 JavaScript 和 Lua 双引擎支持,包含 20+...

45
Emerging
71 zai-org/GLM-skills

Official skills for the GLM family of models.

45
Emerging
72 run-llama/llama-cloud-ts

Typescript SDK for OCR and document parsing in the cloud with LlamaParse

45
Emerging
73 thomaswantstobeaskeleton/BallonsTranslator-Pro

BallonsTranslatorPro — Community fork of BallonsTranslator....

45
Emerging
74 nordie92/AoE4BO

Age of Empires 4 overlay to lern build orders

45
Emerging
75 veryfi/veryfi-nodejs

Node.js module for communicating with the Veryfi OCR API. || read:...

45
Emerging
76 openva/rs-video-processor

The video OCR processor for Richmond Sunlight.

44
Emerging
77 doo/scanbot-sdk-example-ios

Easy-to-use iOS document scanner and data extraction library for native iOS apps

44
Emerging
78 mindee/mindee-api-ruby

Mindee API Helper Library for Ruby

44
Emerging
79 Ronin-CK/QuickSnip

⚡ Lightweight Wayland OCR & Google Lens utility built with Quickshell.

43
Emerging
80 ieasybooks/tahweel.rb

تحويل ملفات PDF إلى TXT و DOCX و JSON

43
Emerging
81 jasperdevs/yoink

Free, open-source screenshot tool. Capture, annotate, and share with a single hotkey.

42
Emerging
82 firecrawl/pdf-inspector

Fast Rust library for PDF inspection, classification, and text extraction....

42
Emerging
83 Cilda/UmaUmaChecker

ウマ娘で選択肢のステータスを画像認識によって可視化する

42
Emerging
84 NirKli/WattBot

Smart OCR app for reading electricity meters from images

42
Emerging
85 simpledms/simpledms

Document management for small businesses.

41
Emerging
86 doo/scanbot-sdk-maui-example

Easy-to-use .NET MAUI document scanner and data extraction library

41
Emerging
87 regulaforensics/DocumentReader-web-js-client

Regula Document Reader web API js client for the browser and node.js based on axios

41
Emerging
88 prabesh704/ImageOCR

🖼️ Build an offline image search engine that indexes images and text for...

41
Emerging
89 SeseydOw/Captcha-Bypass-Tool

pentesting tool to bypass captcha Steam, Gmail, Instagram, Facebook,...

40
Emerging
90 Sixt/tensorlake-go

Go SDK for the Tensorlake API: document intelligence, cloud sandboxes, PTY...

40
Emerging
91 Agions/HardSubX

A professional video hard subtitle extraction tool with OCR. Extract...

40
Emerging
92 veryfi/veryfi-lens-react-native-demo

Example Demo App about how to use our react native wrapper

39
Emerging
93 Akronae/windows_media_ocr_cli

🔎 OCR CLI that outputs structured data with bounding rects using local...

38
Emerging
94 PELock/Dekoder-AZTEC-2D-JavaScript

Dekoder Kodu AZTEC 2D z Dowodu Rejestracyjnego dla JavaScript i Node.js (Web...

37
Emerging
95 1003129155/jietuba

A screenshot ocr and clipboard manager, available only on Windows 10+. ...

37
Emerging
96 abishekgiri/boring-ai

Self-hosted AI expense manager that turns receipts into structured data and...

36
Emerging
97 clark-labs-inc/pdfsink-rs

Fast pure-Rust PDF extraction library and CLI by Clark Labs Inc. — 10–50x...

36
Emerging
98 narzaut/translator

Translator Overlay

35
Emerging
99 hcmhcs/screenTranslate

Screen translation app for macOS — select any area, get instant translation....

35
Emerging
100 nonwill/nonwill.github.io

CDN data of www.autoptr.top.

35
Emerging
101 dsebastien/obsidian-transcriber

Transcriber converts images in your Obsidian vault to Markdown using Ollama...

35
Emerging
102 TroniePh/SmartMacroAI

Advanced Windows & Web Automation Tool. Features: Record Clicks/Scrolls,...

35
Emerging
103 hiimmuc/OCR-Handwritten-equations-solver

Handwritten equation solver: OCR, DNN, Flask

35
Emerging
104 neanes/byzantine-chant-ocr

An OCR toolset for Byzantine chant notation

34
Emerging
105 doo/scanbot-sdk-example-capacitor-ionic

Easy-to-use Ionic and Capacitor document scanner and data extraction library...

34
Emerging
106 JimEverest/fastshot

Fastshot is a GenAI powered screenshot and annotation tool designed to...

34
Emerging
107 veryfi/veryfi-lens-receipts-android-demo

Example codes about how to use Veryfi Lens SDKs

33
Emerging
108 Ganymede-Bio/gridgulp

Automatically detect and extract tables from Excel, CSV, and text files.

33
Emerging
109 veryfi/veryfi-lens-long-receipts-android-demo

Example codes about how to use Veryfi Lens SDKs

33
Emerging
110 0pen-Sourcer/Complete-Utility-App

A versatile desktop app offering wide range of tools for media downloading,...

33
Emerging
111 abbasZaidi110/n8n-Parse-Invoices-Documents-with-Gemini-AI-OCR-and-Google-Sheets-Integration

📄 Streamline invoice processing by integrating n8n with Gemini AI OCR and...

32
Emerging
112 KyleDerZweite/spellbook

Self-hosted TCG collection manager with mobile scanning, OCR recognition,...

32
Emerging
113 rampaa/Tsukikage

Hover-based output sender for OwOCR results

32
Emerging
114 R0mb0/PDF_accessibility_fixer

Client-side tool to check and fix PDF accessibility. Analyze PDFs for text...

32
Emerging
115 veryfi/veryfi-lens-headless-receipts-android-demo

Example codes about how to use Veryfi Lens SDKs

31
Emerging
116 yelog/SnapTraTranslator

一款 macos 离线快速翻译软件

31
Emerging
117 doerfli/reeper

Recipe management web application - parse recipes from image using AI based OCR

31
Emerging
118 victorfu/snap-tray

SnapTray is a tray-native screenshot and recording tool for macOS and...

31
Emerging
119 veryfi/veryfi-lens-barcodes-android-demo

Example about how to user Veryfi Lens for Barcodes

31
Emerging
120 cyanyux/pdf-ocr

Self-hosted GPU-accelerated OCR web app — convert scanned PDFs to searchable...

30
Emerging
121 iLejuxepWaduzd/structured-data-extractor

🛠️ Extract structured data from messy texts using Chain-of-Thought prompting...

30
Emerging
122 nxoti1/POINTS-Reader-OCR

🖥️ Extract text from images easily with POINTS-Reader OCR, a high-accuracy...

30
Emerging
123 rdantassilva/pdf2ocr

A CLI tool to apply OCR on PDF files and export to multiple formats

30
Emerging
124 DCC-BS/docling-glm-ocr

A docling plugin to integrate a remote hosted GLM-OCR OCR model into docling

29
Experimental
125 silenthillzeroq-code/clipnova

Windows clipboard manager that turns clipboard history into notes,...

29
Experimental
126 TanyaMushonga/skymarshal-api

Intelligent aerial traffic monitoring system featuring real-time vehicle...

28
Experimental
127 Sabastincruzz/Tools_DeepSeekOCR

🖥️ Deploy DeepSeek-OCR for Optical Character Recognition directly from...

28
Experimental
128 XUNRANA/LNU-LibSeat-Automation

🎯 辽宁大学图书馆自习室座位自动预约工具 | GUI 双击即用 · 验证码 OCR · 多账号并发 · 精准卡点 · 邮件通知 | Python + Selenium

28
Experimental
129 r-uben/socr

Multi-engine OCR with cascading fallback, quality audit, and figure extraction

28
Experimental
130 dkorbelainen/sniptext

screen text extractor with OCR and spell correction

27
Experimental
131 sw-willie-wu/MediaTranX

AI-powered local multimedia toolkit — speech-to-text, translation,...

27
Experimental
132 nikazzio/universal-iiif-studio

Modular tool for Digital Humanities: IIIF downloader + Studio environment....

27
Experimental
133 JordanCoin/openfoia

Local-first FOIA automation with AI-powered document analysis. Your data...

27
Experimental
134 hyperpolymath/presswerk

High-assurance local print router/server — Dioxus 0.7 mobile app with...

27
Experimental
135 Ajatt-Tools/lancet

OCR application for reading manga in Japanese, made for AJATTers 🇯🇵 .

27
Experimental
136 ieasybooks/tahweel-tauri

تحويل ملفات PDF إلى TXT و DOCX و JSON

27
Experimental
137 dimitar-radenkov/SnippingTool

Lightweight WPF screen capture, annotation, OCR and screen recorder for...

27
Experimental
138 MonDevHub/monocr

The MonOCR Platform: Academic-grade OCR for the Mon language....

26
Experimental
139 lzhgus/Capso

Open-source screenshot and screen recording for macOS. The free, native...

26
Experimental
140 arikusi/sahaf

Local PDF & EPUB to Markdown converter with OCR — runs on your hardware, no...

26
Experimental
141 qyinm/duckdocs

macOS app that parses PDF and Word documents into linked markdown packages using AI.

26
Experimental
142 run-llama/ParseBench

ParseBench - A Document Parsing Benchmark for AI Agents

26
Experimental
143 PaoloESAN/LN-Translator-Mobile

An Android app to translate Japanese Light Novels directly from your screen...

26
Experimental
144 XMuli/QuickUtilitiesSuite

Curated window utilities to boost your workflow (Quick ColorPicke, Quick...

25
Experimental
145 misraj-ai/kawn-python

The official Python SDK for kawn.ai by Misraj AI. High-performance Arabic...

25
Experimental
146 Lianye-Scythe/OCRTranslator

Portable Windows OCR / AI desktop tool with screenshot, selected-text, and...

25
Experimental
147 Alan5168/fapiao-clipper

发票夹子 - 本地大模型驱动的发票自动识别与报销管理工具(适配中国发票)

25
Experimental
148 oskarasadullin/speechma-api

Free, unlimited text-to-speech API with 486+ AI voices — unofficial Python...

25
Experimental
149 AntoC-dev/Recipedia

📱 A React Native recipe management app with OCR scanning, shopping lists,...

24
Experimental
150 veryfi/veryfi-lens-ocr-android-demo

Veryfi Lens OCR to read codes, numbers and short text

23
Experimental
151 duck-ai-yy/ex-ai

前任AI — 前任.skill 的零门槛替代。上传微信聊天记录,生成前任AI数字分身 Prompt。不用安装,打开网页就能用。

23
Experimental
152 TimLChan/steakcam

steakcam - Get notified when there is a 72oz steak challenge happening at...

22
Experimental
153 EdgeTypE/OldTurkicOCR

Pure-Rust Old Turkic (Gokturk) OCR engine powered by ResNet

18
Experimental
154 DevadattaP/math_to_latex

Converting handwritten mathematical expressions to LaTeX using...

18
Experimental