sebastianarnold/TeXoo

TeXoo – A Zoo of Text Extractors

/ 100

Emerging

This project helps you automatically identify and categorize key pieces of information, like names of people, places, or organizations, within large volumes of text. You input raw text documents, and it outputs these identified entities, helping you structure unstructured data. This is useful for data analysts, researchers, or anyone needing to extract specific facts from text.

No commits in the last 6 months.

Use this if you need a robust, deep learning-based tool to automatically find and classify named entities across many text documents.

Not ideal if you are not comfortable working with command-line tools or integrating Java libraries into existing software, as this is designed for technical users.

text-analysis information-extraction named-entity-recognition data-mining document-processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Java

License

Apache-2.0

Higher-rated alternatives

maximtrp/bitermplus

Biterm Topic Model (BTM): modeling topics in short texts

stephenhky/PyShortTextCategorization

Various Algorithms for Short Text Mining

clips/pattern

Web mining module for Python, with tools for scraping, natural language processing, machine...

Hassaan-Elahi/Writing-Styles-Classification-Using-Stylometric-Analysis

✍️ An intelligent system that takes a document and classifies different writing styles within...

eimg/burmese-text-classifier

A neural network based text classification system for Burmese

Explore NLP Tools

All categories Trending NLP directory Insights