Java NLP Frameworks NLP Tools

Comprehensive NLP libraries and frameworks for Java with core linguistic processing pipelines (tokenization, parsing, NER, etc.). Does NOT include language-specific tools, annotation platforms, single-task extractors, or non-Java implementations.

There are 137 java nlp frameworks tools tracked. 1 score above 70 (verified tier). The highest-rated is apache/opennlp at 74/100 with 1,586 stars. 1 of the top 10 are actively maintained.

Get all 137 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=java-nlp-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 apache/opennlp

Apache OpenNLP

74
Verified
2 stanfordnlp/CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence...

61
Established
3 stanfordnlp/python-stanford-corenlp

Python interface to CoreNLP using a bidirectional server-client interface.

59
Established
4 dkpro/dkpro-core

Collection of software components for natural language processing (NLP)...

59
Established
5 apache/opennlp-sandbox

Apache OpenNLP Sandbox

58
Established
6 FraBle/python-sutime

Python wrapper for Stanford CoreNLP's SUTime

56
Established
7 FraBle/python-duckling

Python wrapper for wit.ai's Duckling Clojure library

55
Established
8 proycon/folia

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based...

55
Established
9 proycon/colibri-core

Colibri core is an NLP tool as well as a C++ and Python library for working...

53
Established
10 gerardobort/node-corenlp

CoreNLP @ NodeJS

52
Established
11 NLPchina/nlp-lang

这个项目是一个基本包.封装了大多数nlp项目中常用工具

51
Established
12 CogComp/cogcomp-nlp

CogComp's Natural Language Processing Libraries and Demos: Modules include...

51
Established
13 zaibacu/rita-dsl

A Domain Specific Language (DSL) for building language patterns. These can...

49
Emerging
14 facebookarchive/duckling_old

Deprecated in favor of https://github.com/facebook/duckling

49
Emerging
15 stanfordnlp/phrasal

A large-scale statistical machine translation system written in Java.

49
Emerging
16 intersystems/iknow

Community development repository for iKnow

49
Emerging
17 LanguageMachines/libfolia

FoLiA library for C++

48
Emerging
18 neuml/txtai.java

Java client for txtai

47
Emerging
19 ardoco/core

Core framework for traceability link recovery between architecture...

46
Emerging
20 clojurenlp/core

Clojure wrapper for the Stanford CoreNLP Java library

45
Emerging
21 apache/incubator-nlpcraft

Apache NLPCraft - API to convert natural language into actions.

45
Emerging
22 tupilabs/HumanNameParser.java

HumanNameParser.java, a Java port of HumanNameParser.php. Parser for human...

45
Emerging
23 theopenconversationkit/tock-corenlp

Optional part of the Tock toolkit based on Stanford CoreNLP library

44
Emerging
24 yegor256/requs

Controlled Natural Language for Requirements Specifications, with Maven Plugin

43
Emerging
25 mojtaba-khallash/JHazm

A Java version of Hazm (Python library for digesting Persian text)

43
Emerging
26 apache/opennlp-models

Apache OpenNLP Models

42
Emerging
27 macournoyer/utterance_parser

Extract intent and entities from natural language utterances

41
Emerging
28 sea-boat/TextAnalyzer

A text analyzer which is based on machine learning,statistics and...

41
Emerging
29 termsuite/termsuite-core

A Java UIMA-based toolbox for multilingual and efficient terminology...

41
Emerging
30 tspannhw/nifi-nlp-processor

Apache NiFi NLP Processor

40
Emerging
31 turbopape/postagga

A Library to parse natural language in pure Clojure and ClojureScript

39
Emerging
32 leovan/duckling-chinese

中文 Duckling

39
Emerging
33 toltoxgh/CoreNLP-jMWE

Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word...

39
Emerging
34 acoli-repo/olia

Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation...

39
Emerging
35 Ruthwik/Language-Detection

Language Detection in Java using OpenNLP

38
Emerging
36 sunil3590/artificial-guy

Natural Language Processing + Knowledge Representation + Reasoning + Natural...

38
Emerging
37 xszhong/syntime

The Java source code and datasets of our ACL2017 paper: Time Expression...

37
Emerging
38 dbamman/book-nlp

Natural language processing pipeline for book-length documents (archival...

37
Emerging
39 sina-al/pynlp

A pythonic wrapper for Stanford CoreNLP.

37
Emerging
40 neo-search/multiple-string-searcher

Java library for fast multiple strings matchings. Uses internally...

37
Emerging
41 dhowe/rita4j

RiTa for Java

37
Emerging
42 eXist-db/exist-stanford-nlp

XQuery wrapper around the Stanford CoreNLP pipeline

36
Emerging
43 fquirin/java-nlu-tools

Java tools to do natural language processing like NER and intent...

36
Emerging
44 etiennesillon/ModelRunner

No-code, model driven, natural language data access platform

36
Emerging
45 manzurola/spaCy4j

Java client for spaCy and more.

36
Emerging
46 Bushra-KB/Amharic-NLP-Tools-in-JAVA

This repository contains implementations of various Natural Language...

36
Emerging
47 JULIELab/JeSemE

Jena Semantic Explorer

35
Emerging
48 quadrama/DramaNLP

UIMA NLP components for dramatic texts

35
Emerging
49 Sciss/jawjaw

Automatically exported from code.google.com/p/jawjaw

34
Emerging
50 IBMStreams/streamsx.nlp

Provide operations for text analysis, like lemmatization and text annotation...

34
Emerging
51 whelk-io/flesch-kincaid

Java OSS library for performing Flesch-Kincaid readability tests.

34
Emerging
52 apache/incubator-nlpcraft-java-client

Native Java Client for Apache NLPCraft.

34
Emerging
53 simongray/clojure-dsl-resources

A curated list of Clojure resources for dealing with domain-specific languages.

34
Emerging
54 caowm/JNI-HanLP

通过Delphi JNI封装的HanLP(汉语言处理)库

34
Emerging
55 marcusklang/docria

Semi-structured Document Model (Next-generation)

34
Emerging
56 oyahiroki/nlp4j

Natural Language Processing library for Java

33
Emerging
57 londogard/londogard-nlp-toolkit

Londogard Natural Language Processing Toolkit written in Kotlin

33
Emerging
58 soumyadip007/Natural-Language-Processing-Using-Java

Natural language processing is a subfield of linguistics, computer science,...

33
Emerging
59 simongray/datalinguist

Stanford CoreNLP in idiomatic Clojure.

33
Emerging
60 hgvgfgvh/JavaAiLocalKit

This library aims to provide Java developers with an easy-to-use,...

32
Emerging
61 open-hopin/mbFXWords

divide plain text in subject, predicate, object with OpenNLP

32
Emerging
62 rbehzadan/opennlp-service

RESTful API for Apache OpenNLP Library

32
Emerging
63 kon-drees/Fachpraktikum_K01589

Explanation component for a practical course "Natural Language Processing,...

31
Emerging
64 SemanticSoftwareLab/TextMining-LODeXporter

LODeXporter is a GATE component for transforming NLP annotations into RDF triples

31
Emerging
65 manzurola/errant4j

An unofficial Java port of ERRANT, the parallel text grammatical error annotator

31
Emerging
66 pepperkit/corenlp-stop-words-annotator

Stop words annotator for Stanford's CoreNLP library.

31
Emerging
67 6thsolution/ApexNLP

A natural language event parser for java and android.

31
Emerging
68 lexigram/example-code

Example code in multiple languages for interacting with Lexigram APIs

31
Emerging
69 dpom/clj-duckling

Language, engine, and tooling for expressing, testing, and evaluating...

31
Emerging
70 CogComp/jwnl-prime

Modified version of the JWNL (http://sourceforge.net/projects/jwordnet/)

31
Emerging
71 abzif/opennlp-model-generator

OpenNLP model generator for various languages

30
Emerging
72 DennisDeSwart/php-stanford-corenlp-adapter

PHP adapter for Stanford CoreNLP

30
Emerging
73 DEK11/MoreNLP

Capabilities of StanfordNLP and OpenNLP on Spark

29
Experimental
74 HebrewNLP/HebrewNLP4J

Java Library and Examples for using the HebrewNLP API

29
Experimental
75 ispasic/FlexiTerm-Java

Repository for FlexiTerm: a software tool to automatically recognise...

29
Experimental
76 clecoq75/semantic-annotator

Semantic Annotator

29
Experimental
77 plandes/clj-nlp-parse

Natural Language Parsing and Feature Generation

29
Experimental
78 tspannhw/nifi-langdetect-processor

Apache NiFi + Apache Tika + OptimaizeLangDetector

29
Experimental
79 iss-lab/nifi-stanfordcorenlp-processor

Nifi Stanford CoreNLP Processor

28
Experimental
80 proycon/nederlab-pipeline

Linguistic enrichment pipeline for historical dutch, as used in the Nederlab project

28
Experimental
81 ottiram/MMAX2

Official repo of the MMAX2 annotation tool

27
Experimental
82 arpieb/corenlp

The CoreNLP package is a thin Elixir client for the Stanford CoreNLP Server.

27
Experimental
83 InAnYan/spacy_pat_match_dsl

A simple DSL for creating spaCy pattern matchers

27
Experimental
84 textocat/textokit-core

TextoKit - is a set of components for Natural Language Processing based on...

27
Experimental
85 EmnamoR/NLP-processing-using-NLTK-Stanford-Core-nlp

nlp processing ( pos-tag, parsing , ner , coref resolution) using NLTK Stanford nlp

26
Experimental
86 jimtyhurst/stanford-parser-util-java

Java utility classes to simplify use of the Stanford Parser for a specific...

26
Experimental
87 justhalf/java-nlp-toolkit

My personal Java NLP toolkit that serves as an interface to various existing...

26
Experimental
88 hjian42/SVO_Automation

Automatic SVO Extraction Tool for Social Science

25
Experimental
89 jacopofar/fleximatcher

Small text annotation library

25
Experimental
90 data-integrations/nlp

A collection of directives and plugins for Natural Language Processing

25
Experimental
91 krajeswaran/ArticleAnalyzer

English language article analyzer using some NLP techniques

24
Experimental
92 dbracewell/hermes

A Natural Language Processing framework for Java

24
Experimental
93 IE4OpenData/Octroy

Octroy Pipeline for Open Data in Quebec

23
Experimental
94 hianshithere/Natural-Language-Processing

Realted Natural langage Processing Codes || Sentiment Analysis ||...

23
Experimental
95 bean5/nlp-paraphrase-detector

I built this to automate discovery of common text between two documents. I...

23
Experimental
96 jiangnanboy/j4nlp

java for nlp,java自然语言处理

23
Experimental
97 ARUKA-2017/processing-engine

An Open Source Maven Project for extracting valuable data and creating...

22
Experimental
98 plandes/clj-example-nlp-ml

Example Project for Natural Language Processing and Machine Learning Libraries

21
Experimental
99 bvolpato/mdmlang

🔄 Natural Transformation Language for Java

21
Experimental
100 davidecaroselli/jcld3

Java Native Interface for the popular Compact Language Detector v3 (CLD3) by Google

20
Experimental
101 CogComp/CogCompTime

CogCompTime

20
Experimental
102 encryptorcode/pluralize

Pluralize and singularize any word.

20
Experimental
103 CatalystCode/ClausIE-Server

Server for the ClausIE OpenIE parser

20
Experimental
104 tokenmill/dictionary-annotator

Fast and configurable UIMA dictionary annotator.

20
Experimental
105 ispasic/FlexiTermCymraeg

Unsupervised multi–word term recognition in Welsh

19
Experimental
106 igr/parlo

Answering questions using NLP in Java, made in 5 days

19
Experimental
107 rkaissi/java-autocomplete

Java autocomplete suggestion engine using a custom NLP implementation....

19
Experimental
108 kariminf/nalanpar

Natural language Parser

19
Experimental
109 mcre/aiwolf-gat2017-nlp

nlp

19
Experimental
110 Unique-Digital-Resources/Rule-Based-Natural-Language-Meaning-Processing

A rule-based tool for processing and extracting meanings from natural...

19
Experimental
111 kariminf/faris

Knowledge representation, Sentences, Language modeling, MultiLingual,

19
Experimental
112 mikahama/uralicNLP-Java

Multilingual Natural Language Processing for Java

19
Experimental
113 pkourdis/gateplugin-SUTime

GATE plugin to annotate documents with TIMEX3 tags using the SUTime library.

18
Experimental
114 kariminf/sentrep

This is a tool for different sentence representations like json, xml, etc.

17
Experimental
115 umanlp/wcl-wrapper

A simple wrapper around the Word-Class Lattices Java API.

17
Experimental
116 kariminf/nalangen

NaLanGen stands for "Natural Language Generator". The aim of this tool is to...

17
Experimental
117 kariminf/langpi

Language processing interface: some tools to process different natural languages

17
Experimental
118 seanghay/web-crfsuite

A CRFSuite port for Node, Browser & Deno

17
Experimental
119 kyuna0312/Natural_Language_Java

NLP Java Stanford data

16
Experimental
120 simon-zhiyuan-sun/text-forensics-toolkit

Clojure-based text-forensics toolkit: NLP profiling, psycholinguistic...

15
Experimental
121 ProjectPersephone/WebL0

front and back end for tools using Natural Semantic Metalanguage,...

13
Experimental
122 marcusklang/langforia

Language pipelines

12
Experimental
123 its-leo/Wikisource-Book-Analyzer

This tool fetches books from en.wikisource.org and analyzes them via Natural...

12
Experimental
124 ziyaddinovchiyev/stanford-nlp-custom-annotator

Custom Emotion Annotator for Stanford CoreNLP Java

11
Experimental
125 xszhong/tomn

The Java source code and datasets of our WWW2018 paper: Time Expression...

11
Experimental
126 pauldiac/QuoVadis

QuoVadis: annotation of Entities and Relations, initial Ph.D. work

11
Experimental
127 pcerman/etaoin-shrdlu

This is my attempt to make the SHRDLU program executable.

11
Experimental
128 TalMizrahii/Hearst-Patterns

The project is divided into two parts, each of which focuses on detecting...

11
Experimental
129 scientist-labs/phrasekit

Weak supervision for NER: mine domain-specific phrases from unlabeled...

11
Experimental
130 data-sandbox/nlp-brewer-finder

App to find and filter nearby breweries based on offerings

11
Experimental
131 SemanticWebBuilder/SWBNLP

Natural Language Processing utilities for SemanticWebBuilder

11
Experimental
132 crim-ca/PacteClient

Java client to access PACTE plaform

11
Experimental
133 elitcloud/elit-java

ELIT SDK for Java.

11
Experimental
134 nilswende/JLanI

Java language identification tool

11
Experimental
135 brainrex-intel/brainrex-java

Java client for the Brainrex API.

10
Experimental
136 mr-m0nkey/NLP

NLP Beginner project

10
Experimental
137 jzonthemtn/opennlp-benchmarks

A collection of Apache OpenNLP model training examples with performance data.

10
Experimental