oidlabs-com/Lexoid

Multimodal document parser for high quality data understanding and extraction

48
/ 100
Emerging

This tool helps you quickly extract high-quality text from various documents like PDFs or web pages, even complex ones, by leveraging advanced AI. You input a document, and it provides clean, structured text ready for analysis or further processing. Anyone who regularly needs to pull information from a large volume of documents, such as researchers, legal professionals, or data analysts, would find this useful.

Use this if you need to reliably convert diverse documents (PDFs, web pages) into well-structured text, especially when dealing with complex layouts or multi-modal content.

Not ideal if you only need basic text extraction from simple, text-only files or if you require fine-grained control over layout preservation for visual reproduction.

document-processing data-extraction content-analysis information-retrieval research-automation
No Package No Dependents
Maintenance 10 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

96

Forks

11

Language

Python

License

Apache-2.0

Last pushed

Mar 12, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/oidlabs-com/Lexoid"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.