LaTeX OCR Tools ML Frameworks

Tools and models for converting images of mathematical equations, formulas, and technical diagrams into LaTeX code or markup. Includes handwritten and printed formula recognition, circuit diagram conversion, and speech-to-LaTeX translation. Does NOT include general document OCR, non-mathematical content recognition, or LaTeX editing/compilation tools.

There are 57 latex ocr tools frameworks tracked. 13 score above 50 (established tier). The highest-rated is ogkalu2/comic-translate at 69/100 with 2,445 stars. 5 of the top 10 are actively maintained.

Get all 57 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=latex-ocr-tools&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 ogkalu2/comic-translate

Desktop app for automatically translating comics - BDs, Manga, Manhwa,...

69
Established
2 naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

68
Established
3 mayocream/koharu

ML-powered manga translator, written in Rust.

62
Established
4 tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

61
Established
5 mindspore-lab/mindocr

A toolbox of ocr models and algorithms based on MindSpore

59
Established
6 zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

59
Established
7 open-mmlab/mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

58
Established
8 dmMaze/BallonsTranslator

深度学习辅助漫画翻译工具, 支持一键机翻和简单的图像/文本编辑 | Yet another computer-aided comic/manga...

57
Established
9 lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

55
Established
10 microsoft/OCR-Form-Tools

A set of tools to use in Microsoft Azure Form Recognizer and OCR services.

53
Established
11 LinXueyuanStdio/LaTeX_OCR_PRO

:art: 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro,...

50
Established
12 VoxelCubes/PanelCleaner

An AI-powered tool to clean manga panels.

50
Established
13 LinXueyuanStdio/LaTeX_OCR

:gem: 数学公式识别 Math Formula OCR

50
Established
14 kingyiusuen/image-to-latex

Convert images of LaTex math equations into LaTex code.

48
Emerging
15 SakuraMathcraft/LaTeXSnipper

A powerful LaTeX formula recognition tool powered by pix2tex and pix2text. ...

48
Emerging
16 dmMaze/comic-text-detector

Manga&Comic text detection

47
Emerging
17 KUR-creative/SickZil-Machine

Manga/Comics Translation Helper Tool

47
Emerging
18 fh2019ustc/Awesome-Document-Image-Rectification

A comprehensive list of awesome document image rectification papers.

46
Emerging
19 AlibabaResearch/AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced...

46
Emerging
20 RQLuo/MixTeX-Latex-OCR

MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient...

45
Emerging
21 VikParuchuri/texify

Math OCR model that outputs LaTeX and markdown

44
Emerging
22 untrix/im2latex

Solution to im2latex request for research of openai

44
Emerging
23 YongWookHa/swin-transformer-ocr

swin-transformer custom for OCR

44
Emerging
24 bensonruan/Tesseract-OCR

Tesseract.js OCR

44
Emerging
25 juvian/Manga-Text-Segmentation

Segmentation of text in manga images

42
Emerging
26 jtl1207/comic-translation

基于深度学习的漫画翻译辅助工具,包含翻译、朗读、图像去字、自动嵌字功能。 目的是帮助非专业汉化人员完成更简单,快速的翻译任务。

42
Emerging
27 ritheshkumar95/im2latex-tensorflow

Tensorflow implementation of the HarvardNLP paper - What You Get Is What You...

41
Emerging
28 stacksapien/react-tesseract-ocr

Tesseract OCR implementation in React JS

40
Emerging
29 XJF2332/GOT-OCR-2-GUI

GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能

40
Emerging
30 Aeonss/BubbleBlaster

Bubble Blaster removes text from speech bubbles in mangas/manhwas, made for...

40
Emerging
31 vkit-x/vkit

Boosting Document Intelligence

35
Emerging
32 sitammeur/TextSnap

TextSnap: Demo for Florence 2 model used in OCR tasks to extract and...

35
Emerging
33 JeffersonQin/YuzuMarker

🍋 [WIP] Manga Translation Tool

34
Emerging
34 lizhaoliu-Lec/CCSE

Instance Segmentation for Chinese Character Stroke Extraction, Datasets and...

33
Emerging
35 tuanio/image2latex

Image to Latex using Encoder-Decoder architecture

31
Emerging
36 jackvial/tuatara

Tuatara: Deep Learning OCR Engine

31
Emerging
37 jasmine-dragons/VoTeX

Speech to LaTeX translator. LA Hacks 3rd place overall and Best Hack Using...

31
Emerging
38 CatUnderTheLeaf/musicScanner

Optical Music Recognition using Deep Learning

31
Emerging
39 tony-xlh/SynthMRZ

Code for generating synthetic MRZ images

28
Experimental
40 rn-snehapriya/Automatic-Note-Taking-From-Video-Using-Tesseract-OCR

Text from the video is extracted and saved into a .docx file in the form of notes.

27
Experimental
41 AdelRizq/Orchestra

Orchestra is a sheet music reader (optical music recognition (OMR) system)...

26
Experimental
42 chencxt/MoreMTQE

更多更易用的机器翻译质量评估(Machine Translation Quality Estimation)方案

25
Experimental
43 nakamura196/koten-ocr-ios

KotenOCR — iOS app for OCR of classical and modern Japanese texts using NDL...

25
Experimental
44 olibridge01/TeXOCR

Optical Character Recognition (OCR) model for Image-to-LaTeX conversion

24
Experimental
45 endx707/tesseract

🖥️ Perform optical character recognition with Tesseract, an open-source tool...

22
Experimental
46 RQLuo/MixTeX-DataHub

LaTeXDataHub is an open-source platform dedicated to the sharing and...

21
Experimental
47 Muiz20/macula

Detect and correct OCR errors directly in the browser using a lightweight,...

21
Experimental
48 gnurt2041/MangaOCR

A lightweight OCR model for Japanese text, especially in Manga

21
Experimental
49 Aadv1k/OctetOCR

Octet is an exploratory OCR or text recognition library to prepare and train...

19
Experimental
50 Carath/TeXdrawer

Small tool for handwritten LaTeX symbols recognition.

18
Experimental
51 SlavaKuzkinHackathon/ScanTire-AI-Architecture

Technical architecture and ML pipeline overview for the ScanTire.com OCR &...

14
Experimental
52 Makena123456/Paper-Comicizer

📚 Transform academic PDFs into engaging Doraemon comics for easier...

14
Experimental
53 harishB97/Im2Latex-TensorFlow-2

TensorFlow-2 implementation of Im2Latex deep learning model described in...

12
Experimental
54 Ashishkumar-hub/image_to_text_ocr

Image to text conversion using tesserocr

11
Experimental
55 itsitgroup/ocr-streamlit-demo

This repository serves as a StreamLit frontend for Computer Vision based OCR...

11
Experimental
56 bensonruan/Ocrad.js

Ocrad.js - Harnessing the Power of Optical Character Recognition in JavaScript

10
Experimental
57 timothe-chaumont/circuit-to-latex

Deep-learning model that converts electrical circuits images to LaTeX diagrams.

10
Experimental