naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

68
/ 100
Established

This tool helps you extract text from images, making it searchable and editable. You provide an image containing text, and it gives you the recognized text in over 100 languages. It's ideal for anyone who needs to quickly get text out of scanned documents, photos, or screenshots, without manually retyping.

37,920 stars. Used by 5 other packages. Available on npm.

Use this if you need to convert text embedded within images into a usable digital format for editing, searching, or archiving.

Not ideal if your project requires extracting text from PDF files or needs advanced improvements to OCR accuracy beyond the core Tesseract engine's capabilities.

document-digitization data-entry information-retrieval content-extraction image-to-text
Maintenance 10 / 25
Adoption 15 / 25
Maturity 25 / 25
Community 18 / 25

How are scores calculated?

Stars

37,920

Forks

2,363

Language

JavaScript

License

Apache-2.0

Category

latex-ocr-tools

Last pushed

Feb 28, 2026

Commits (30d)

0

Dependencies

9

Reverse dependents

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/naptha/tesseract.js"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.