General NLP Toolkits NLP Tools

Comprehensive NLP libraries and toolkits that bundle multiple fundamental NLP tasks (tokenization, stemming, POS tagging, sentiment analysis, NER) into reusable modules. Does NOT include single-task focused tools, domain-specific NLP applications, or language-specific implementations.

There are 85 general nlp toolkits tools tracked. 3 score above 70 (verified tier). The highest-rated is chrismattmann/tika-python at 81/100 with 1,651 stars. 2 of the top 10 are actively maintained.

Get all 85 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=general-nlp-toolkits&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 chrismattmann/tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing...

81
Verified
2 sloria/TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech...

81
Verified
3 cltk/cltk

The Classical Language Toolkit

70
Verified
4 allenai/scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

64
Established
5 wi2trier/cbrkit

Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in...

60
Established
6 grid-parity-exchange/Egret

Tools for building power systems optimization problems

59
Established
7 CGCL-codes/naturalcc

NaturalCC: An Open-Source Toolkit for Code Intelligence

57
Established
8 delph-in/pydelphin

Python libraries for DELPH-IN

56
Established
9 SamEdwardes/spacytextblob

A TextBlob sentiment analysis pipeline component for spaCy.

54
Established
10 ysenarath/sinling

A collection of NLP tools for Sinhalese (සිංහල).

52
Established
11 rajcscw/nlp-gym

NLPGym - A toolkit to develop RL agents to solve NLP tasks.

49
Emerging
12 sharejing/Takin

A Python toolkit for file processing, text cleaning and data splitting....

48
Emerging
13 lyeoni/prenlp

Preprocessing Library for Natural Language Processing

47
Emerging
14 StarlangSoftware/NlpToolkit

Natural Language Processing Library

47
Emerging
15 NIHOPA/NLPre

Python library for Natural Language Preprocessing (NLPre)

47
Emerging
16 soskek/bookcorpus

Crawl BookCorpus

47
Emerging
17 neocl/speach

🐍🍑 Python 3 library for managing, annotating, and converting natural...

47
Emerging
18 rlayers/pawpaw

Text Processing & Segmentation Framework

46
Emerging
19 DukeEnglish/knlp

非常好用的工具包,可以直接安装并使用

46
Emerging
20 quadrismegistus/prosodic

Prosodic: a metrical-phonological parser, written in Python. For English and...

46
Emerging
21 sliderSun/pynlp

NLP related tasks, including text classification, sequence annotation, text...

45
Emerging
22 alexandrainst/danlp

DaNLP is a repository for Natural Language Processing resources for the...

45
Emerging
23 dhopp1/nlp_pipeline

Collection of NLP tools for processing and analyzing text data.

43
Emerging
24 stevewyl/nlp_toolkit

Basic NLP Toolkits include text classification, sequence labeling etc.

42
Emerging
25 Vidito/vidinlp

VidiNLP is a simple, modern, and fast NLP library built on top of spaCy. It...

41
Emerging
26 d99kris/spacy-cpp

C++ wrapper library for the NLP library spaCy

40
Emerging
27 ckiplab/ckipnlp

CKIP CoreNLP Toolkits

40
Emerging
28 wyounas/homer

Homer, a text analyser in Python, can help make your text more clear, simple...

40
Emerging
29 lvzii/nlpertools

NLP/数据工作人员常用函数,简化读写操作,代码更加可读。Functions commonly used by NLP/data...

40
Emerging
30 microsoft/verseagility

Ramp up your custom natural language processing (NLP) task, allowing you to...

40
Emerging
31 DerXter/State-of-NLP-Research-in-Senegal

First comprehensive survey of NLP work carried out in Senegalese languages...

40
Emerging
32 paudan/opennlp_python

Python NLTK module for interfacing with the Apache OpenNLP

39
Emerging
33 lattice-8094/propp

PROPP: A Python library for narrative analysis

39
Emerging
34 asyml/forte-wrappers

Forte wrapper of third-party toolkits.

38
Emerging
35 worldbank/wb-nlp-tools

Natural language processing tools developed by the World Bank's DECAT unit....

37
Emerging
36 lord-alfred/dnlp

📚 Сборник полезных штук из Natural Language Processing: Определение языка...

37
Emerging
37 NLP-Suite/NLP-Suite

CLICK ON WIKI LINK BELOW OR ON Wiki TAB AT TOP BANNER FOR DOWNLOAD...

37
Emerging
38 dlite-tools/NLPiper

NLPiper is a package that agglomerates different NLP tools and applies their...

36
Emerging
39 pharo-ai/Polyglot

A library for Natural Language Processing

34
Emerging
40 wjbmattingly/bagpipes-spacy

Bagpipes spaCy is a collection of custom spaCy pipeline components designed...

34
Emerging
41 entelecheia/ekorpkit

ekorpkit: NLP Library for Social Science Research

33
Emerging
42 AdrianBZG/Polyglotter

[Nature Scientific Reports] Translating synthetic natural language to...

32
Emerging
43 dcondrey/BridgeNLP

A universal NLP model-to-pipeline integration framework.

32
Emerging
44 markgw/pimlico

The Pimlico Processing Toolkit

30
Emerging
45 F1uctus/ttc

✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP].

30
Emerging
46 hulsedev/hulse-py

The Python client for the Hulse platform

30
Emerging
47 olcaytaner/NlpToolkit

Natural Language Processing Library

30
Emerging
48 capjamesg/pysurprisal

Calculate surprisal for words in text.

29
Experimental
49 TakeLab/podium

Podium: a framework agnostic Python NLP library for data loading and preprocessing

29
Experimental
50 davidjurgens/sporc

Python library for working with the Structured Podcast Open Research Corpus (SPORC)

29
Experimental
51 moraleida/wordpress-nlp-tools

A toolkit for using self-hosted Natural Language Processing with...

28
Experimental
52 aditeyabaral/nlpc

Natural Language Toolkit built using the C Programming Language

28
Experimental
53 wencan/distilNLP

distilNLP is a natural language processing toolkits that are out-of-the-box,...

28
Experimental
54 tinq-ai/tinq-python

Tinq.ai python library

27
Experimental
55 peterdalle/svensktext

Svenska språkresurser: kvinno- och mansnamn, orter, län, kommuner, länder,...

27
Experimental
56 hitchhicker/tweet_nlp_toolkit

Tweet NLP toolkit

27
Experimental
57 liuziwei7/nlp-toolkit

NLP Toolkit for Social Science (NTSS)

26
Experimental
58 Valuebai/pyqt5_nlp_tools

PyQt5 集成Nlp 小工具(文本合并、分词、词性标注),也可作为Pyqt5的入门例子

25
Experimental
59 erickorsi/polyglot-windows-installation

Installation and setup of Polyglot package for Python 3 on Windows.

23
Experimental
60 eistakovskii/NLP_Toolbox

This repository contains various tools intended for handling Natural...

23
Experimental
61 ashishpoudel995/NLP-for-Nepali-Language

The project is all about Natural Language Processing for the Nepali...

23
Experimental
62 goldwing4/pyml

🐍 Write Python code using a YAML-like syntax for cleaner and more readable...

22
Experimental
63 ojrlopez27/nl-service-composition

NLSC Unrestricted Natural Language-based Service Composition Middleware that...

21
Experimental
64 MayADevBe/Swedish-FrequencyList-8Sidor

Data Mining - Swedisch Freqency List from Sidor8

20
Experimental
65 fbn776/Word-groups

An effort to organize more than 370k words in the English language.

20
Experimental
66 galax19ksh/Manipuri-NLP

A comprehensive overview of research regarding Natural Language Processing...

20
Experimental
67 andrea-ballatore/place-vocabulary

This repository contains a vocabulary of nouns used to describe places, such...

20
Experimental
68 KhiemNguyen2104/CO3085-Chunking-in-NLP

A repository for my NLP project that focus on chuking techniques in NLP for...

19
Experimental
69 Dechrissen/hstk

Toolkit for creating and interfacing with a database of news headlines

19
Experimental
70 partigabor/scikit-talk

A toolkit to process real-world conversational speech data (moved)

19
Experimental
71 Dellonath/nlp-cognitive-engine

Development of a NLP Cognitive Engine using the KERAS Deep Learning...

18
Experimental
72 siara-cc/Text_frequency_research

Developing language frequency lists from Reddit/Twitter posts and Wikipedia articles

18
Experimental
73 Alex-bzh/L2TAL

Resources for L2 TAL at Inalco

18
Experimental
74 TohidN/ElevateNLP

ElevateNLP is a web based NLP Platform for creation, organization,...

17
Experimental
75 JJWRoeloffs/DynamicFluency

DynamicFluency - Monitor and understand the dynamicity of linguistic aspects...

17
Experimental
76 PujanMotiwala/the_nlp_toolkit

Your comprehensive resource for mastering NLP techniques. Includes text...

17
Experimental
77 Thableaddles57/nlp-toolkit

A Python-based natural language processing toolkit with common NLP tasks...

14
Experimental
78 Riccorl/ipa

NLP Preprocessing Pipeline Wrappers

13
Experimental
79 rezansrv/Natural-Language-Processing-NLP

Multilingual NLP Text Processing in Python

12
Experimental
80 TRomesh/sinhala-parser

Sinhala parser project is based on Natural Language Processing (NLP)

11
Experimental
81 harrycslau/finnish-vocab-learning-nlp

A small NLP project enable Finnish learners to learn new vocabulary.

11
Experimental
82 maxslimb/NLP-Chunking

Text chunking using NLP

11
Experimental
83 imadulhassan/WordsCourpus-Using-NLTK

This python script use to extract words from WORDNET using Python Natural...

10
Experimental
84 entropy2333/nlp-utils

easy-to-use nlp utils library

10
Experimental
85 puya-pro/nlp-en_model

this an easy NLP model for beginners . this model just for English lang ....

10
Experimental