All NLP Tools
13,598 tools ranked by quality score · Page 3 of 136
| # | Tool | Score | Tier |
|---|---|---|---|
| 201 |
isaacus-dev/isaacus-python
A Python library for interacting with the Isaacus API. |
|
Established |
| 202 |
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese... |
|
Established |
| 203 |
yohasebe/wp2txt
A command-line tool to extract plain text from Wikipedia dumps with category... |
|
Established |
| 204 |
SeanLee97/xmnlp
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能 |
|
Established |
| 205 |
sciknoworg/OntoAligner
OntoAligner: A Python Toolkit for Ontology Alignment... |
|
Established |
| 206 |
shibing624/dialogbot
dialogbot, provide search-based dialogue, task-based dialogue and generative... |
|
Established |
| 207 |
mirkosertic/FXDesktopSearch
A JavaFX based desktop search application. |
|
Established |
| 208 |
stephenhky/PyShortTextCategorization
Various Algorithms for Short Text Mining |
|
Established |
| 209 |
sushil79g/Nepali_nlp
A python based library for NLP in Nepali language |
|
Established |
| 210 |
explosion/spacy-streamlit
👑 spaCy building blocks and visualizers for Streamlit apps |
|
Established |
| 211 |
jalajthanaki/NLPython
This repository contains the code related to Natural Language Processing... |
|
Established |
| 212 |
paschmann/rasa-ui
Rasa UI is a frontend for the Rasa Framework |
|
Established |
| 213 |
KennethEnevoldsen/asent
Asent is a python library for performing efficient and transparent sentiment... |
|
Established |
| 214 |
houbb/sensitive-word
👮♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java... |
|
Established |
| 215 |
CGCL-codes/naturalcc
NaturalCC: An Open-Source Toolkit for Code Intelligence |
|
Established |
| 216 |
NLP-LOVE/Introduction-NLP
HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中... |
|
Established |
| 217 |
nltk/nltk_data
NLTK Data |
|
Established |
| 218 |
ropensci/googleLanguageR
R client for the Google Translation API, Google Cloud Natural Language API... |
|
Established |
| 219 |
chrislit/abydos
Abydos NLP/IR library for Python |
|
Established |
| 220 |
daac-tools/vibrato
🎤 vibrato: Viterbi-based accelerated tokenizer |
|
Established |
| 221 |
natasha/corus
Links to Russian corpora + Python functions for loading and parsing |
|
Established |
| 222 |
fastnlp/fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation. |
|
Established |
| 223 |
messense/jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust |
|
Established |
| 224 |
natasha/razdel
Rule-based token, sentence segmentation for Russian language |
|
Established |
| 225 |
bltlab/seqscore
SeqScore: Scoring for named entity recognition and other sequence labeling tasks |
|
Established |
| 226 |
messense/fasttext-serving
fastText model serving service |
|
Established |
| 227 |
bataak/dict-mn
Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary |
|
Established |
| 228 |
mideind/GreynirServer
The greynir.is Icelandic natural language processing API and website. |
|
Established |
| 229 |
ines/spacy-js
🎀 JavaScript API for spaCy with Python REST API |
|
Established |
| 230 |
PaddlePaddle/ERNIE
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade... |
|
Established |
| 231 |
snipsco/snips-nlu
Snips Python library to extract meaning from text |
|
Established |
| 232 |
lovit/KR-WordRank
비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는 라이브러리입니다 |
|
Established |
| 233 |
SimGus/Chatette
A powerful dataset generator for Rasa NLU, inspired by Chatito |
|
Established |
| 234 |
EdinburghNLP/awesome-hallucination-detection
List of papers on hallucination detection in LLMs. |
|
Established |
| 235 |
rudikershaw/whichx
A small, no dependencies, Naive Bayes Text Classifier for JavaScript |
|
Established |
| 236 |
FraBle/python-sutime
Python wrapper for Stanford CoreNLP's SUTime |
|
Established |
| 237 |
quadbio/cell-annotator
Automatically annotate cell types, consistently across samples. |
|
Established |
| 238 |
thunlp/OpenHowNet
Core Data of HowNet and OpenHowNet Python API |
|
Established |
| 239 |
PragatiVerma18/MLH-Quizzet
This is a smart Quiz Generator that generates a dynamic quiz from any... |
|
Established |
| 240 |
interpretml/interpret-text
A library that incorporates state-of-the-art explainers for text-based... |
|
Established |
| 241 |
staticdev/human-readable
Lib to make data intended for machines, readable to humans. |
|
Established |
| 242 |
RocketChat/hubot-natural
Natural Language Processing Chatbot for RocketChat |
|
Established |
| 243 |
delph-in/pydelphin
Python libraries for DELPH-IN |
|
Established |
| 244 |
preligens-lab/textnoisr
Adding random noise to a text dataset, and controlling very accurately the... |
|
Established |
| 245 |
centre-for-humanities-computing/DaCy
DaCy: The State of the Art Danish NLP pipeline using SpaCy |
|
Established |
| 246 |
R1j1t/contextualSpellCheck
✔️Contextual word checker for better suggestions (not actively maintained) |
|
Established |
| 247 |
sileod/tasknet
Easy modernBERT fine-tuning and multi-task learning |
|
Established |
| 248 |
openfactcheck-research/openfactcheck
An Open-source Factuality Evaluation Demo for LLMs |
|
Established |
| 249 |
yogeshhk/MiningResume
Text Mining certain fields from a resume |
|
Established |
| 250 |
zjunlp/OpenUE
[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text |
|
Established |
| 251 |
opencog/link-grammar
The CMU Link Grammar natural language parser |
|
Established |
| 252 |
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda |
|
Established |
| 253 |
openeventdata/mordecai
Full text geoparsing as a Python library |
|
Established |
| 254 |
fukuball/jieba-php
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese... |
|
Established |
| 255 |
jidasheng/bi-lstm-crf
A PyTorch implementation of the BI-LSTM-CRF model. |
|
Established |
| 256 |
SWHL/AI-Competition-Collections
AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖) |
|
Established |
| 257 |
brucewlee/lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction;... |
|
Established |
| 258 |
huspacy/huspacy
HuSpaCy: industrial-strength Hungarian natural language processing |
|
Established |
| 259 |
JohnSnowLabs/johnsnowlabs
Gateway into the John Snow Labs Ecosystem |
|
Established |
| 260 |
Rostlab/nalaf
NLP framework in python for entity recognition and relationship extraction |
|
Established |
| 261 |
averbis/averbis-python-api
Conveniently access the REST API of Averbis products using Python |
|
Established |
| 262 |
proycon/pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language... |
|
Established |
| 263 |
Extralit/extralit
Fast and accurate systemic data extraction with LLM assistance |
|
Established |
| 264 |
dselivanov/text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R. |
|
Established |
| 265 |
emres/turkish-deasciifier
Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs |
|
Established |
| 266 |
bnosac/udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and... |
|
Established |
| 267 |
OpenNMT/Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support |
|
Established |
| 268 |
shibing624/pytextclassifier
pytextclassifier is a toolkit for text classification.... |
|
Established |
| 269 |
quickwit-oss/whichlang
A blazingly fast and lightweight language detection library for Rust |
|
Established |
| 270 |
carlosplanchon/betterhtmlchunking
BetterHTMLChunking is a Python library for intelligent HTML segmentation. It... |
|
Established |
| 271 |
apache/ctakes
Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text. |
|
Established |
| 272 |
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g.,... |
|
Established |
| 273 |
polm/cutlet
Japanese to romaji converter in Python |
|
Established |
| 274 |
kakaobrain/word2word
Easy-to-use word-to-word translations for 3,564 language pairs. |
|
Established |
| 275 |
jenojp/negspacy
spaCy pipeline object for negating concepts in text |
|
Established |
| 276 |
stanford-oval/genie-toolkit
The Genie open source kit for voice assistant (formerly known as Almond) |
|
Established |
| 277 |
ikegami-yukino/pymlask
Emotion analyzer for Japanese text |
|
Established |
| 278 |
proycon/folia
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based... |
|
Established |
| 279 |
KoichiYasuoka/esupar
Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT... |
|
Established |
| 280 |
graykode/toeicbert
TOEIC(Test of English for International Communication) solving using... |
|
Established |
| 281 |
guillaume-be/rust-bert
Rust native ready-to-use NLP pipelines and transformer-based models (BERT,... |
|
Established |
| 282 |
FraBle/python-duckling
Python wrapper for wit.ai's Duckling Clojure library |
|
Established |
| 283 |
panggi/pujangga
Pujangga - Indonesian Natural Language Processing Tool with REST API, an... |
|
Established |
| 284 |
bcgov/tno
Today's News Online (TNO) is a news aggregation system that takes in news... |
|
Established |
| 285 |
web-arena-x/webarena
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents" |
|
Established |
| 286 |
KRLabsOrg/rulechef
Learn rule-based models from examples using LLM-powered synthesis. Replace... |
|
Established |
| 287 |
Systemcluster/kitoken
Fast and versatile tokenizer for language models, compatible with... |
|
Established |
| 288 |
vulnerability-lookup/VulnTrain
A tool to generate datasets and models based on vulnerabilities descriptions... |
|
Established |
| 289 |
fbilhaut/gline-rs
Inference engine for GLiNER models, in Rust |
|
Established |
| 290 |
indix/whatthelang
Lightning Fast Language Prediction 🚀 |
|
Established |
| 291 |
SamEdwardes/spacytextblob
A TextBlob sentiment analysis pipeline component for spaCy. |
|
Established |
| 292 |
HzaCode/OneCite
📚 An intelligent toolkit to automatically parse, complete, and format... |
|
Established |
| 293 |
sugarme/tokenizer
NLP tokenizers written in Go language |
|
Established |
| 294 |
explosion/spacy-experimental
🧪 Cutting-edge experimental spaCy components and features |
|
Established |
| 295 |
gutfeeling/word_forms
Accurately generate all possible forms of an English word e.g "election" -->... |
|
Established |
| 296 |
alvinwan/timefhuman
Extract datetimes and durations from natural language text as Python... |
|
Established |
| 297 |
jbesomi/texthero
Text preprocessing, representation and visualization from zero to hero. |
|
Established |
| 298 |
bretttolbert/verbecc
Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French,... |
|
Established |
| 299 |
Blake-Madden/OleanderStemmingLibrary
Porter stemming library (C++) |
|
Established |
| 300 |
BramVanroy/spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as... |
|
Established |