All NLP Tools
13,598 tools ranked by quality score · Page 2 of 136
| # | Tool | Score | Tier |
|---|---|---|---|
| 101 |
jerryji1993/DNABERT
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers... |
|
Established |
| 102 |
microsoft/Recognizers-Text
Microsoft.Recognizers.Text provides recognition and resolution of numbers,... |
|
Established |
| 103 |
stanfordnlp/CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence... |
|
Established |
| 104 |
zjunlp/DeepKE
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction |
|
Established |
| 105 |
msgi/nlp-journey
Documents, papers and codes related to Natural Language Processing,... |
|
Established |
| 106 |
shibing624/similarity
similarity: Text similarity calculation Toolkit for Java.... |
|
Established |
| 107 |
philenius/ngx-annotate-text
This Angular component library is perfect for tasks like visualizing named... |
|
Established |
| 108 |
hb20007/hands-on-nltk-tutorial
The hands-on NLTK tutorial for NLP in Python |
|
Established |
| 109 |
adrien2p/nestjs-dialogflow
Dialog flow module that simplify the web hook handling for your NLP... |
|
Established |
| 110 |
eikek/docspell
Assist in organizing your piles of documents, resulting from scanners,... |
|
Established |
| 111 |
davidjurgens/potato
potato: the portable annotation tool |
|
Established |
| 112 |
Droidtown/ArticutAPI
API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut... |
|
Established |
| 113 |
MLNLP-World/SimBiber
MLNLP社区用来帮助缩短参考文献的工具。A tool for simplifying bibtex with official info |
|
Established |
| 114 |
UCREL/pymusas
Python Multilingual Ucrel Semantic Analysis System |
|
Established |
| 115 |
goru001/inltk
Natural Language Toolkit for Indic Languages aims to provide out of the box... |
|
Established |
| 116 |
yongzhuo/nlp_xiaojiang
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence... |
|
Established |
| 117 |
smilelight/lightNLP
基于Pytorch和torchtext的自然语言处理深度学习框架。 |
|
Established |
| 118 |
FerreroJeremy/ln2sql
A tool to query a database in natural language |
|
Established |
| 119 |
andrewtavis/kwx
BERT, LDA, and TFIDF based keyword extraction in Python |
|
Established |
| 120 |
wi2trier/cbrkit
Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in... |
|
Established |
| 121 |
smart-on-fhir/cumulus-etl
Extract FHIR data, Transform with NLP and DEID tools, and then Load FHIR... |
|
Established |
| 122 |
zaemyung/sentsplit
A flexible sentence segmentation library using CRF model and regex rules |
|
Established |
| 123 |
666ghj/BettaFish
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。 |
|
Established |
| 124 |
yongzhuo/Keras-TextClassification
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP,... |
|
Established |
| 125 |
OmkarPathak/pyresparser
A simple resume parser used for extracting information from resumes |
|
Established |
| 126 |
alvations/pywsd
Python Implementations of Word Sense Disambiguation (WSD) Technologies. |
|
Established |
| 127 |
chartbeat-labs/textacy
NLP, before and after spaCy |
|
Established |
| 128 |
rkcosmos/deepcut
A Thai word tokenization library using Deep Neural Network |
|
Established |
| 129 |
taishi-i/nagisa
A Japanese tokenizer based on recurrent neural networks |
|
Established |
| 130 |
hankcs/pyhanlp
中文分词 |
|
Established |
| 131 |
ssciwr/AMMICO
AI-based Media and Misinformation Content Analysis Tool: Analyze text and images |
|
Established |
| 132 |
smilelight/lightKG
基于Pytorch和torchtext的知识图谱深度学习框架。 |
|
Established |
| 133 |
codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation |
|
Established |
| 134 |
natasha/ipymarkup
NER, syntax markup visualizations |
|
Established |
| 135 |
pemistahl/lingua-rs
The most accurate natural language detection library for Rust, suitable for... |
|
Established |
| 136 |
alirezatheh/perke
A keyphrase extractor for Persian |
|
Established |
| 137 |
Hironsan/anago
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech... |
|
Established |
| 138 |
hyunwoongko/kss
KSS: Korean String processing Suite |
|
Established |
| 139 |
wayfair-incubator/extra-model
Code to run the ExtRA algorithm for unsupervised topic/aspect extraction on... |
|
Established |
| 140 |
davidsbatista/Snowball
Implementation with some extensions of the paper "Snowball: Extracting... |
|
Established |
| 141 |
medspacy/medspacy
Library for clinical NLP with spaCy. |
|
Established |
| 142 |
microsoft/presidio-research
This package features data-science related tasks for developing new... |
|
Established |
| 143 |
sagorbrur/bnlp
BNLP is a natural language processing toolkit for Bengali Language. |
|
Established |
| 144 |
mihail911/fake-news
Building a fake news detector from initial ideation to model deployment |
|
Established |
| 145 |
HLasse/TextDescriptives
A Python library for calculating a large variety of metrics from text |
|
Established |
| 146 |
raghakot/keras-text
Text Classification Library in Keras |
|
Established |
| 147 |
ownthink/Jiagu
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类 |
|
Established |
| 148 |
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP |
|
Established |
| 149 |
baidu/Senta
Baidu's open-source Sentiment Analysis System. |
|
Established |
| 150 |
keon/awesome-nlp
:book: A curated list of resources dedicated to Natural Language Processing (NLP) |
|
Established |
| 151 |
ClipsAI/clipsai
Clips AI is an open-source Python library that automatically converts long... |
|
Established |
| 152 |
rodrigopivi/Chatito
🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition... |
|
Established |
| 153 |
JayYip/m3tl
BERT for Multitask Learning |
|
Established |
| 154 |
jboynyc/textnets
Text analysis with networks. |
|
Established |
| 155 |
Georgetown-IR-Lab/QuickUMLS
System for Medical Concept Extraction and Linking |
|
Established |
| 156 |
ufal/factgenie
Lightweight self-hosted span annotation tool |
|
Established |
| 157 |
stanfordnlp/python-stanford-corenlp
Python interface to CoreNLP using a bidirectional server-client interface. |
|
Established |
| 158 |
nert-nlp/streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword... |
|
Established |
| 159 |
cannlytics/cannlytics
🔥 Cannlytics = cannabis + analytics. Data pipelines, user interfaces, and... |
|
Established |
| 160 |
dkpro/dkpro-core
Collection of software components for natural language processing (NLP)... |
|
Established |
| 161 |
grid-parity-exchange/Egret
Tools for building power systems optimization problems |
|
Established |
| 162 |
shineware/KOMORAN
Korean Morphological Analyzer by shineware |
|
Established |
| 163 |
juliasilge/tidytext
Text mining using tidy tools :sparkles::page_facing_up::sparkles: |
|
Established |
| 164 |
hltcoe/turkle
Django-based clone of Amazon's Mechanical Turk service running in your local... |
|
Established |
| 165 |
taishi-i/awesome-japanese-nlp-resources
A curated list of resources dedicated to Python libraries, LLMs,... |
|
Established |
| 166 |
BrikerMan/Kashgari
Kashgari is a production-level NLP Transfer learning framework built on top... |
|
Established |
| 167 |
mcs07/ChemDataExtractor
Automatically extract chemical information from scientific documents |
|
Established |
| 168 |
huggingface/neuralcoref
✨Fast Coreference Resolution in spaCy with Neural Networks |
|
Established |
| 169 |
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合 |
|
Established |
| 170 |
stair-lab/kg-gen
[NeurIPS '25] Knowledge Graph Generation from Any Text |
|
Established |
| 171 |
nlpbook/nlpbook
Applied Natural Language Processing in the Enterprise - An O'Reilly Media Publication |
|
Established |
| 172 |
thunlp/OpenAttack
An Open-Source Package for Textual Adversarial Attack. |
|
Established |
| 173 |
920232796/bert_seq2seq
pytorch实现 Bert... |
|
Established |
| 174 |
n-waves/multifit
The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual... |
|
Established |
| 175 |
fossology/safaa
Agent to compliment FOSSology's copyright scanner and find false positive findings. |
|
Established |
| 176 |
go-ego/gse
Go efficient multilingual NLP and text segmentation; support English,... |
|
Established |
| 177 |
baidu/lac
百度NLP:分词,词性标注,命名实体识别,词重要性 |
|
Established |
| 178 |
nlp-uoregon/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual... |
|
Established |
| 179 |
apache/opennlp-sandbox
Apache OpenNLP Sandbox |
|
Established |
| 180 |
princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings... |
|
Established |
| 181 |
tanaos/artifex
Small Language Model Inference, Fine-Tuning and Observability. No GPU, no... |
|
Established |
| 182 |
fidelity/textwiser
[AAAI 2021] TextWiser: Text Featurization Library |
|
Established |
| 183 |
asyml/texar
Toolkit for Machine Learning, Natural Language Processing, and Text... |
|
Established |
| 184 |
thisandagain/sentiment
AFINN-based sentiment analysis for Node.js. |
|
Established |
| 185 |
vunb/vntk
Vietnamese NLP Toolkit for Node |
|
Established |
| 186 |
alphanome-ai/sec-parser
Parse SEC EDGAR HTML documents into a tree of elements that correspond to... |
|
Established |
| 187 |
kk7nc/HDLTex
HDLTex: Hierarchical Deep Learning for Text Classification |
|
Established |
| 188 |
charles9n/bert-sklearn
a sklearn wrapper for Google's BERT model |
|
Established |
| 189 |
dice-group/gerbil
GERBIL - General Entity annotatoR Benchmark |
|
Established |
| 190 |
cbaziotis/ekphrasis
Ekphrasis is a text processing tool, geared towards text from social... |
|
Established |
| 191 |
textvec/textvec
Text vectorization tool to outperform TFIDF for classification tasks |
|
Established |
| 192 |
AnasAito/SkillNER
A (smart) rule based NLP module to extract job skills from text |
|
Established |
| 193 |
PaddlePaddle/RocketQA
🚀 RocketQA, dense retrieval for information retrieval and question... |
|
Established |
| 194 |
blmoistawinde/HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法 |
|
Established |
| 195 |
messense/jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust |
|
Established |
| 196 |
bakwc/JamSpell
Modern spell checking library - accurate, fast, multi-language |
|
Established |
| 197 |
yongzhuo/Macropodus
自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要... |
|
Established |
| 198 |
neuspell/neuspell
NeuSpell: A Neural Spelling Correction Toolkit |
|
Established |
| 199 |
isaacus-dev/isaacus-python
A Python library for interacting with the Isaacus API. |
|
Established |
| 200 |
yohasebe/wp2txt
A command-line tool to extract plain text from Wikipedia dumps with category... |
|
Established |