LaTeX OCR Tools ML Frameworks
Tools and models for converting images of mathematical equations, formulas, and technical diagrams into LaTeX code or markup. Includes handwritten and printed formula recognition, circuit diagram conversion, and speech-to-LaTeX translation. Does NOT include general document OCR, non-mathematical content recognition, or LaTeX editing/compilation tools.
There are 57 latex ocr tools frameworks tracked. 13 score above 50 (established tier). The highest-rated is ogkalu2/comic-translate at 69/100 with 2,445 stars. 5 of the top 10 are actively maintained.
Get all 57 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=latex-ocr-tools&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
ogkalu2/comic-translate
Desktop app for automatically translating comics - BDs, Manga, Manhwa,... |
|
Established |
| 2 |
naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥 |
|
Established |
| 3 |
mayocream/koharu
ML-powered manga translator, written in Rust. |
|
Established |
| 4 |
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository) |
|
Established |
| 5 |
mindspore-lab/mindocr
A toolbox of ocr models and algorithms based on MindSpore |
|
Established |
| 6 |
zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working) |
|
Established |
| 7 |
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox |
|
Established |
| 8 |
dmMaze/BallonsTranslator
深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga... |
|
Established |
| 9 |
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code. |
|
Established |
| 10 |
microsoft/OCR-Form-Tools
A set of tools to use in Microsoft Azure Form Recognizer and OCR services. |
|
Established |
| 11 |
LinXueyuanStdio/LaTeX_OCR_PRO
:art: 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro,... |
|
Established |
| 12 |
VoxelCubes/PanelCleaner
An AI-powered tool to clean manga panels. |
|
Established |
| 13 |
LinXueyuanStdio/LaTeX_OCR
:gem: 数学公式识别 Math Formula OCR |
|
Established |
| 14 |
kingyiusuen/image-to-latex
Convert images of LaTex math equations into LaTex code. |
|
Emerging |
| 15 |
SakuraMathcraft/LaTeXSnipper
A powerful LaTeX formula recognition tool powered by pix2tex and pix2text. ... |
|
Emerging |
| 16 |
dmMaze/comic-text-detector
Manga&Comic text detection |
|
Emerging |
| 17 |
KUR-creative/SickZil-Machine
Manga/Comics Translation Helper Tool |
|
Emerging |
| 18 |
fh2019ustc/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers. |
|
Emerging |
| 19 |
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced... |
|
Emerging |
| 20 |
RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient... |
|
Emerging |
| 21 |
VikParuchuri/texify
Math OCR model that outputs LaTeX and markdown |
|
Emerging |
| 22 |
untrix/im2latex
Solution to im2latex request for research of openai |
|
Emerging |
| 23 |
YongWookHa/swin-transformer-ocr
swin-transformer custom for OCR |
|
Emerging |
| 24 |
bensonruan/Tesseract-OCR
Tesseract.js OCR |
|
Emerging |
| 25 |
juvian/Manga-Text-Segmentation
Segmentation of text in manga images |
|
Emerging |
| 26 |
jtl1207/comic-translation
基于深度学习的漫画翻译辅助工具,包含翻译、朗读、图像去字、自动嵌字功能。 目的是帮助非专业汉化人员完成更简单,快速的翻译任务。 |
|
Emerging |
| 27 |
ritheshkumar95/im2latex-tensorflow
Tensorflow implementation of the HarvardNLP paper - What You Get Is What You... |
|
Emerging |
| 28 |
stacksapien/react-tesseract-ocr
Tesseract OCR implementation in React JS |
|
Emerging |
| 29 |
XJF2332/GOT-OCR-2-GUI
GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能 |
|
Emerging |
| 30 |
Aeonss/BubbleBlaster
Bubble Blaster removes text from speech bubbles in mangas/manhwas, made for... |
|
Emerging |
| 31 |
vkit-x/vkit
Boosting Document Intelligence |
|
Emerging |
| 32 |
sitammeur/TextSnap
TextSnap: Demo for Florence 2 model used in OCR tasks to extract and... |
|
Emerging |
| 33 |
JeffersonQin/YuzuMarker
🍋 [WIP] Manga Translation Tool |
|
Emerging |
| 34 |
lizhaoliu-Lec/CCSE
Instance Segmentation for Chinese Character Stroke Extraction, Datasets and... |
|
Emerging |
| 35 |
tuanio/image2latex
Image to Latex using Encoder-Decoder architecture |
|
Emerging |
| 36 |
jackvial/tuatara
Tuatara: Deep Learning OCR Engine |
|
Emerging |
| 37 |
jasmine-dragons/VoTeX
Speech to LaTeX translator. LA Hacks 3rd place overall and Best Hack Using... |
|
Emerging |
| 38 |
CatUnderTheLeaf/musicScanner
Optical Music Recognition using Deep Learning |
|
Emerging |
| 39 |
tony-xlh/SynthMRZ
Code for generating synthetic MRZ images |
|
Experimental |
| 40 |
rn-snehapriya/Automatic-Note-Taking-From-Video-Using-Tesseract-OCR
Text from the video is extracted and saved into a .docx file in the form of notes. |
|
Experimental |
| 41 |
AdelRizq/Orchestra
Orchestra is a sheet music reader (optical music recognition (OMR) system)... |
|
Experimental |
| 42 |
chencxt/MoreMTQE
更多更易用的机器翻译质量评估(Machine Translation Quality Estimation)方案 |
|
Experimental |
| 43 |
nakamura196/koten-ocr-ios
KotenOCR — iOS app for OCR of classical and modern Japanese texts using NDL... |
|
Experimental |
| 44 |
olibridge01/TeXOCR
Optical Character Recognition (OCR) model for Image-to-LaTeX conversion |
|
Experimental |
| 45 |
endx707/tesseract
🖥️ Perform optical character recognition with Tesseract, an open-source tool... |
|
Experimental |
| 46 |
RQLuo/MixTeX-DataHub
LaTeXDataHub is an open-source platform dedicated to the sharing and... |
|
Experimental |
| 47 |
Muiz20/macula
Detect and correct OCR errors directly in the browser using a lightweight,... |
|
Experimental |
| 48 |
gnurt2041/MangaOCR
A lightweight OCR model for Japanese text, especially in Manga |
|
Experimental |
| 49 |
Aadv1k/OctetOCR
Octet is an exploratory OCR or text recognition library to prepare and train... |
|
Experimental |
| 50 |
Carath/TeXdrawer
Small tool for handwritten LaTeX symbols recognition. |
|
Experimental |
| 51 |
SlavaKuzkinHackathon/ScanTire-AI-Architecture
Technical architecture and ML pipeline overview for the ScanTire.com OCR &... |
|
Experimental |
| 52 |
Makena123456/Paper-Comicizer
📚 Transform academic PDFs into engaging Doraemon comics for easier... |
|
Experimental |
| 53 |
harishB97/Im2Latex-TensorFlow-2
TensorFlow-2 implementation of Im2Latex deep learning model described in... |
|
Experimental |
| 54 |
Ashishkumar-hub/image_to_text_ocr
Image to text conversion using tesserocr |
|
Experimental |
| 55 |
itsitgroup/ocr-streamlit-demo
This repository serves as a StreamLit frontend for Computer Vision based OCR... |
|
Experimental |
| 56 |
bensonruan/Ocrad.js
Ocrad.js - Harnessing the Power of Optical Character Recognition in JavaScript |
|
Experimental |
| 57 |
timothe-chaumont/circuit-to-latex
Deep-learning model that converts electrical circuits images to LaTeX diagrams. |
|
Experimental |