All NLP Tools

13,598 tools ranked by quality score · Page 2 of 136

Showing 101–200 of 13,598
# Tool Score Tier
101 jerryji1993/DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers...

61
Established
102 microsoft/Recognizers-Text

Microsoft.Recognizers.Text provides recognition and resolution of numbers,...

61
Established
103 stanfordnlp/CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence...

61
Established
104 zjunlp/DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

61
Established
105 msgi/nlp-journey

Documents, papers and codes related to Natural Language Processing,...

61
Established
106 shibing624/similarity

similarity: Text similarity calculation Toolkit for Java....

61
Established
107 philenius/ngx-annotate-text

This Angular component library is perfect for tasks like visualizing named...

61
Established
108 hb20007/hands-on-nltk-tutorial

The hands-on NLTK tutorial for NLP in Python

61
Established
109 adrien2p/nestjs-dialogflow

Dialog flow module that simplify the web hook handling for your NLP...

61
Established
110 eikek/docspell

Assist in organizing your piles of documents, resulting from scanners,...

61
Established
111 davidjurgens/potato

potato: the portable annotation tool

61
Established
112 Droidtown/ArticutAPI

API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut...

61
Established
113 MLNLP-World/SimBiber

MLNLP社区用来帮助缩短参考文献的工具。A tool for simplifying bibtex with official info

60
Established
114 UCREL/pymusas

Python Multilingual Ucrel Semantic Analysis System

60
Established
115 goru001/inltk

Natural Language Toolkit for Indic Languages aims to provide out of the box...

60
Established
116 yongzhuo/nlp_xiaojiang

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence...

60
Established
117 smilelight/lightNLP

基于Pytorch和torchtext的自然语言处理深度学习框架。

60
Established
118 FerreroJeremy/ln2sql

A tool to query a database in natural language

60
Established
119 andrewtavis/kwx

BERT, LDA, and TFIDF based keyword extraction in Python

60
Established
120 wi2trier/cbrkit

Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in...

60
Established
121 smart-on-fhir/cumulus-etl

Extract FHIR data, Transform with NLP and DEID tools, and then Load FHIR...

60
Established
122 zaemyung/sentsplit

A flexible sentence segmentation library using CRF model and regex rules

60
Established
123 666ghj/BettaFish

微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。

60
Established
124 yongzhuo/Keras-TextClassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP,...

60
Established
125 OmkarPathak/pyresparser

A simple resume parser used for extracting information from resumes

60
Established
126 alvations/pywsd

Python Implementations of Word Sense Disambiguation (WSD) Technologies.

60
Established
127 chartbeat-labs/textacy

NLP, before and after spaCy

60
Established
128 rkcosmos/deepcut

A Thai word tokenization library using Deep Neural Network

60
Established
129 taishi-i/nagisa

A Japanese tokenizer based on recurrent neural networks

60
Established
130 hankcs/pyhanlp

中文分词

60
Established
131 ssciwr/AMMICO

AI-based Media and Misinformation Content Analysis Tool: Analyze text and images

60
Established
132 smilelight/lightKG

基于Pytorch和torchtext的知识图谱深度学习框架。

60
Established
133 codertimo/BERT-pytorch

Google AI 2018 BERT pytorch implementation

60
Established
134 natasha/ipymarkup

NER, syntax markup visualizations

60
Established
135 pemistahl/lingua-rs

The most accurate natural language detection library for Rust, suitable for...

60
Established
136 alirezatheh/perke

A keyphrase extractor for Persian

60
Established
137 Hironsan/anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech...

60
Established
138 hyunwoongko/kss

KSS: Korean String processing Suite

60
Established
139 wayfair-incubator/extra-model

Code to run the ExtRA algorithm for unsupervised topic/aspect extraction on...

60
Established
140 davidsbatista/Snowball

Implementation with some extensions of the paper "Snowball: Extracting...

60
Established
141 medspacy/medspacy

Library for clinical NLP with spaCy.

60
Established
142 microsoft/presidio-research

This package features data-science related tasks for developing new...

59
Established
143 sagorbrur/bnlp

BNLP is a natural language processing toolkit for Bengali Language.

59
Established
144 mihail911/fake-news

Building a fake news detector from initial ideation to model deployment

59
Established
145 HLasse/TextDescriptives

A Python library for calculating a large variety of metrics from text

59
Established
146 raghakot/keras-text

Text Classification Library in Keras

59
Established
147 ownthink/Jiagu

Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类

59
Established
148 brightmart/nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

59
Established
149 baidu/Senta

Baidu's open-source Sentiment Analysis System.

59
Established
150 keon/awesome-nlp

:book: A curated list of resources dedicated to Natural Language Processing (NLP)

59
Established
151 ClipsAI/clipsai

Clips AI is an open-source Python library that automatically converts long...

59
Established
152 rodrigopivi/Chatito

🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition...

59
Established
153 JayYip/m3tl

BERT for Multitask Learning

59
Established
154 jboynyc/textnets

Text analysis with networks.

59
Established
155 Georgetown-IR-Lab/QuickUMLS

System for Medical Concept Extraction and Linking

59
Established
156 ufal/factgenie

Lightweight self-hosted span annotation tool

59
Established
157 stanfordnlp/python-stanford-corenlp

Python interface to CoreNLP using a bidirectional server-client interface.

59
Established
158 nert-nlp/streusle

STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword...

59
Established
159 cannlytics/cannlytics

🔥 Cannlytics = cannabis + analytics. Data pipelines, user interfaces, and...

59
Established
160 dkpro/dkpro-core

Collection of software components for natural language processing (NLP)...

59
Established
161 grid-parity-exchange/Egret

Tools for building power systems optimization problems

59
Established
162 shineware/KOMORAN

Korean Morphological Analyzer by shineware

59
Established
163 juliasilge/tidytext

Text mining using tidy tools :sparkles::page_facing_up::sparkles:

59
Established
164 hltcoe/turkle

Django-based clone of Amazon's Mechanical Turk service running in your local...

59
Established
165 taishi-i/awesome-japanese-nlp-resources

A curated list of resources dedicated to Python libraries, LLMs,...

59
Established
166 BrikerMan/Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top...

59
Established
167 mcs07/ChemDataExtractor

Automatically extract chemical information from scientific documents

59
Established
168 huggingface/neuralcoref

✨Fast Coreference Resolution in spaCy with Neural Networks

59
Established
169 lonePatient/awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

59
Established
170 stair-lab/kg-gen

[NeurIPS '25] Knowledge Graph Generation from Any Text

58
Established
171 nlpbook/nlpbook

Applied Natural Language Processing in the Enterprise - An O'Reilly Media Publication

58
Established
172 thunlp/OpenAttack

An Open-Source Package for Textual Adversarial Attack.

58
Established
173 920232796/bert_seq2seq

pytorch实现 Bert...

58
Established
174 n-waves/multifit

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual...

58
Established
175 fossology/safaa

Agent to compliment FOSSology's copyright scanner and find false positive findings.

58
Established
176 go-ego/gse

Go efficient multilingual NLP and text segmentation; support English,...

58
Established
177 baidu/lac

百度NLP:分词,词性标注,命名实体识别,词重要性

58
Established
178 nlp-uoregon/trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual...

58
Established
179 apache/opennlp-sandbox

Apache OpenNLP Sandbox

58
Established
180 princeton-nlp/SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings...

58
Established
181 tanaos/artifex

Small Language Model Inference, Fine-Tuning and Observability. No GPU, no...

58
Established
182 fidelity/textwiser

[AAAI 2021] TextWiser: Text Featurization Library

58
Established
183 asyml/texar

Toolkit for Machine Learning, Natural Language Processing, and Text...

58
Established
184 thisandagain/sentiment

AFINN-based sentiment analysis for Node.js.

58
Established
185 vunb/vntk

Vietnamese NLP Toolkit for Node

58
Established
186 alphanome-ai/sec-parser

Parse SEC EDGAR HTML documents into a tree of elements that correspond to...

58
Established
187 kk7nc/HDLTex

HDLTex: Hierarchical Deep Learning for Text Classification

58
Established
188 charles9n/bert-sklearn

a sklearn wrapper for Google's BERT model

58
Established
189 dice-group/gerbil

GERBIL - General Entity annotatoR Benchmark

58
Established
190 cbaziotis/ekphrasis

Ekphrasis is a text processing tool, geared towards text from social...

58
Established
191 textvec/textvec

Text vectorization tool to outperform TFIDF for classification tasks

58
Established
192 AnasAito/SkillNER

A (smart) rule based NLP module to extract job skills from text

58
Established
193 PaddlePaddle/RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question...

58
Established
194 blmoistawinde/HarvestText

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

58
Established
195 messense/jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

57
Established
196 bakwc/JamSpell

Modern spell checking library - accurate, fast, multi-language

57
Established
197 yongzhuo/Macropodus

自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要...

57
Established
198 neuspell/neuspell

NeuSpell: A Neural Spelling Correction Toolkit

57
Established
199 isaacus-dev/isaacus-python

A Python library for interacting with the Isaacus API.

57
Established
200 yohasebe/wp2txt

A command-line tool to extract plain text from Wikipedia dumps with category...

57
Established