michaelmml/NLP-Information-Extraction

Automated PDF and text processing with Spacy and NLTK; information extraction from text based on grammatical structure; deployed on extracted raw search data

27
/ 100
Experimental

This tool helps researchers, analysts, or business intelligence professionals automatically process large volumes of text, such as company transcripts, patent documents, or news articles. It takes raw text or PDFs as input and extracts key information like topics, keywords, named entities (like company names), and significant phrases. The output helps you quickly understand content, identify trends, and summarize lengthy documents without manual review.

No commits in the last 6 months.

Use this if you need to quickly extract structured insights and key information from large unstructured text datasets like financial reports, legal documents, or industry news.

Not ideal if you need to perform sentiment analysis, question-answering, or generate new text rather than extract existing information.

market-research patent-analysis business-intelligence financial-analysis competitive-intelligence
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 5 / 25

How are scores calculated?

Stars

16

Forks

1

Language

Jupyter Notebook

License

MIT

Last pushed

Apr 01, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/michaelmml/NLP-Information-Extraction"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.