textocat/textokit-core
TextoKit - is a set of components for Natural Language Processing based on Apache UIMA platform.
TextoKit provides a collection of tools for analyzing and processing human language. It takes raw text, such as documents or articles, and can extract meaningful information, categorize content, or identify specific entities within the text. This is designed for engineers or researchers who are building applications that need to understand or work with large amounts of text data.
No commits in the last 6 months.
Use this if you are building a system that requires robust, modular natural language processing capabilities for tasks like information extraction, text classification, or content analysis.
Not ideal if you are looking for a ready-to-use application for text analysis and don't have programming experience to integrate components.
Stars
16
Forks
1
Language
Java
License
Apache-2.0
Category
Last pushed
Jul 06, 2016
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/textocat/textokit-core"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
apache/opennlp
Apache OpenNLP
stanfordnlp/CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing,...
stanfordnlp/python-stanford-corenlp
Python interface to CoreNLP using a bidirectional server-client interface.
dkpro/dkpro-core
Collection of software components for natural language processing (NLP) based on the Apache UIMA...
apache/opennlp-sandbox
Apache OpenNLP Sandbox