richardpaulhudson/holmes-extractor
Information extraction from English and German texts based on predicate logic
This tool helps extract specific pieces of information or categorize content from English and German text documents, like articles, reports, or customer messages. You provide the text and define what you're looking for using clear search phrases or example documents, and it pulls out relevant details or classifies the document's topic. This is ideal for analysts, researchers, or anyone needing to systematically organize or retrieve facts from large text collections.
143 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to automatically find specific facts within texts or group documents by their core meaning, such as identifying all mentions of company mergers or categorizing customer feedback by issue.
Not ideal if your main goal is simple keyword searching or if you primarily work with unstructured data that isn't sentence-based, like images or audio.
Stars
143
Forks
13
Language
Python
License
MIT
Category
Last pushed
Jun 06, 2023
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/richardpaulhudson/holmes-extractor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chakki-works/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
Hironsan/anago
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
jbesomi/texthero
Text preprocessing, representation and visualization from zero to hero.
hamelsmu/ktext
Utilities for preprocessing text for deep learning with Keras
asahi417/tner
Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...