naptha/tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
This tool helps you extract text from images, making it searchable and editable. You provide an image containing text, and it gives you the recognized text in over 100 languages. It's ideal for anyone who needs to quickly get text out of scanned documents, photos, or screenshots, without manually retyping.
37,920 stars. Used by 5 other packages. Available on npm.
Use this if you need to convert text embedded within images into a usable digital format for editing, searching, or archiving.
Not ideal if your project requires extracting text from PDF files or needs advanced improvements to OCR accuracy beyond the core Tesseract engine's capabilities.
Stars
37,920
Forks
2,363
Language
JavaScript
License
Apache-2.0
Category
Last pushed
Feb 28, 2026
Commits (30d)
0
Dependencies
9
Reverse dependents
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/naptha/tesseract.js"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Recent Releases
Related frameworks
ogkalu2/comic-translate
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a...
mayocream/koharu
ML-powered manga translator, written in Rust.
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
mindspore-lab/mindocr
A toolbox of ocr models and algorithms based on MindSpore
zyddnys/manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)