sebastianarnold/TeXoo
TeXoo – A Zoo of Text Extractors
This project helps you automatically identify and categorize key pieces of information, like names of people, places, or organizations, within large volumes of text. You input raw text documents, and it outputs these identified entities, helping you structure unstructured data. This is useful for data analysts, researchers, or anyone needing to extract specific facts from text.
No commits in the last 6 months.
Use this if you need a robust, deep learning-based tool to automatically find and classify named entities across many text documents.
Not ideal if you are not comfortable working with command-line tools or integrating Java libraries into existing software, as this is designed for technical users.
Stars
18
Forks
3
Language
Java
License
Apache-2.0
Category
Last pushed
Jun 02, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sebastianarnold/TeXoo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
maximtrp/bitermplus
Biterm Topic Model (BTM): modeling topics in short texts
stephenhky/PyShortTextCategorization
Various Algorithms for Short Text Mining
clips/pattern
Web mining module for Python, with tools for scraping, natural language processing, machine...
Hassaan-Elahi/Writing-Styles-Classification-Using-Stylometric-Analysis
✍️ An intelligent system that takes a document and classifies different writing styles within...
eimg/burmese-text-classifier
A neural network based text classification system for Burmese