bytefer/ollama-ocr

Implementing OCR with a local visual model run by ollama.

50
/ 100
Established

This tool helps you accurately extract text from images while preserving its original layout and formatting. You provide a JPG, JPEG, or PNG image, and it outputs the recognized text, optionally in Markdown. It's designed for developers or technical users who need to automate text extraction from documents, photos, or scanned materials.

300 stars. No commits in the last 6 months. Available on npm.

Use this if you need a reliable way to get structured text from images using locally-run, powerful visual AI models and you have the technical skills to integrate a Node.js library.

Not ideal if you're looking for a simple, plug-and-play OCR application without any programming or if you need to process file formats other than JPG, JPEG, or PNG.

document-processing data-extraction image-to-text content-digitization visual-ai
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 15 / 25

How are scores calculated?

Stars

300

Forks

28

Language

TypeScript

License

MIT

Last pushed

Nov 27, 2024

Commits (30d)

0

Dependencies

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/bytefer/ollama-ocr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.