seanghay/KhmerOCR

A Fast Khmer Optical Character Recognition (KhmerOCR)

47
/ 100
Emerging

This tool helps individuals and organizations convert scanned Khmer documents, images, or PDFs into editable text formats like Word, HTML, Markdown, or plain text. It accurately recognizes Khmer script, detects different font styles (like Moul vs. Regular), and preserves document layouts. Anyone working with physical or digital Khmer documents who needs to extract and edit their content would find this useful.

Use this if you need to quickly and accurately convert images or PDF documents containing Khmer script into editable digital text.

Not ideal if your documents contain English or other non-Khmer languages, as those are not currently supported.

Khmer document processing document digitization data entry automation information extraction archival scanning
No Package No Dependents
Maintenance 10 / 25
Adoption 8 / 25
Maturity 13 / 25
Community 16 / 25

How are scores calculated?

Stars

48

Forks

9

Language

C++

License

MIT

Last pushed

Feb 12, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/seanghay/KhmerOCR"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.