opensemanticsearch/open-semantic-search

Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)

51
/ 100
Established

This tool helps you quickly make sense of large collections of documents by transforming them into a searchable knowledge base. It takes various document types, including images and PDFs, extracts key information like names, organizations, and locations, and lets you search, browse, and analyze the content. This is ideal for researchers, analysts, or anyone who needs to explore vast amounts of unstructured text data efficiently.

1,154 stars. No commits in the last 6 months.

Use this if you need to process, search, and analyze extensive archives of documents, PDFs, or scanned images to uncover insights and relationships.

Not ideal if you only need a simple keyword search for a small number of already well-structured documents.

document-analysis research-intelligence information-retrieval knowledge-management text-mining
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

1,154

Forks

196

Language

Shell

License

GPL-3.0

Last pushed

Apr 19, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/opensemanticsearch/open-semantic-search"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.