nasa-jpl-memex/memex-gate
General Architecture for Text Engineering
This tool helps legal researchers and investigators analyze large collections of legal documents, news articles, and other texts. It takes various document formats as input—like court filings, press releases, or scanned PDFs—and identifies key information such as names, organizations, dates, and legal terms. The output is structured data that can be used for searching, visualizing connections, and understanding trends across vast archives of text.
No commits in the last 6 months.
Use this if you need to process and extract specific entities and concepts from a large volume of legal or domain-specific documents for research or investigative purposes.
Not ideal if you're looking for a simple text editor or a tool for basic document conversion without advanced linguistic analysis capabilities.
Stars
49
Forks
22
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 23, 2016
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/nasa-jpl-memex/memex-gate"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
textvec/textvec
Text vectorization tool to outperform TFIDF for classification tasks
DigitalPebble/behemoth
Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
cooperability/BMX-bookmark-extractor
Better brain. Knowledge management tool. Stop saving things you'll never read. Work in progress.
NISH1001/tag-generator
A simple tool to generate tags for the given text (document) using TF-IDF.
paradite/tf-idf-keyword
:mag_right: Get keywords from a piece of text using tf-idf