All NLP Tools

13,598 tools ranked by quality score · Page 3 of 136

Showing 201–300 of 13,598
# Tool Score Tier
201 isaacus-dev/isaacus-python

A Python library for interacting with the Isaacus API.

57
Established
202 esbatmop/MNBVC

MNBVC(Massive Never-ending BT Vast Chinese...

57
Established
203 yohasebe/wp2txt

A command-line tool to extract plain text from Wikipedia dumps with category...

57
Established
204 SeanLee97/xmnlp

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能

57
Established
205 sciknoworg/OntoAligner

OntoAligner: A Python Toolkit for Ontology Alignment...

57
Established
206 shibing624/dialogbot

dialogbot, provide search-based dialogue, task-based dialogue and generative...

57
Established
207 mirkosertic/FXDesktopSearch

A JavaFX based desktop search application.

57
Established
208 stephenhky/PyShortTextCategorization

Various Algorithms for Short Text Mining

57
Established
209 sushil79g/Nepali_nlp

A python based library for NLP in Nepali language

57
Established
210 explosion/spacy-streamlit

👑 spaCy building blocks and visualizers for Streamlit apps

57
Established
211 jalajthanaki/NLPython

This repository contains the code related to Natural Language Processing...

57
Established
212 paschmann/rasa-ui

Rasa UI is a frontend for the Rasa Framework

57
Established
213 KennethEnevoldsen/asent

Asent is a python library for performing efficient and transparent sentiment...

57
Established
214 houbb/sensitive-word

👮‍♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java...

57
Established
215 CGCL-codes/naturalcc

NaturalCC: An Open-Source Toolkit for Code Intelligence

57
Established
216 NLP-LOVE/Introduction-NLP

HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中...

57
Established
217 nltk/nltk_data

NLTK Data

57
Established
218 ropensci/googleLanguageR

R client for the Google Translation API, Google Cloud Natural Language API...

57
Established
219 chrislit/abydos

Abydos NLP/IR library for Python

57
Established
220 daac-tools/vibrato

🎤 vibrato: Viterbi-based accelerated tokenizer

57
Established
221 natasha/corus

Links to Russian corpora + Python functions for loading and parsing

57
Established
222 fastnlp/fastNLP

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

57
Established
223 messense/jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

57
Established
224 natasha/razdel

Rule-based token, sentence segmentation for Russian language

56
Established
225 bltlab/seqscore

SeqScore: Scoring for named entity recognition and other sequence labeling tasks

56
Established
226 messense/fasttext-serving

fastText model serving service

56
Established
227 bataak/dict-mn

Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary

56
Established
228 mideind/GreynirServer

The greynir.is Icelandic natural language processing API and website.

56
Established
229 ines/spacy-js

🎀 JavaScript API for spaCy with Python REST API

56
Established
230 PaddlePaddle/ERNIE

The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade...

56
Established
231 snipsco/snips-nlu

Snips Python library to extract meaning from text

56
Established
232 lovit/KR-WordRank

비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는 라이브러리입니다

56
Established
233 SimGus/Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito

56
Established
234 EdinburghNLP/awesome-hallucination-detection

List of papers on hallucination detection in LLMs.

56
Established
235 rudikershaw/whichx

A small, no dependencies, Naive Bayes Text Classifier for JavaScript

56
Established
236 FraBle/python-sutime

Python wrapper for Stanford CoreNLP's SUTime

56
Established
237 quadbio/cell-annotator

Automatically annotate cell types, consistently across samples.

56
Established
238 thunlp/OpenHowNet

Core Data of HowNet and OpenHowNet Python API

56
Established
239 PragatiVerma18/MLH-Quizzet

This is a smart Quiz Generator that generates a dynamic quiz from any...

56
Established
240 interpretml/interpret-text

A library that incorporates state-of-the-art explainers for text-based...

56
Established
241 staticdev/human-readable

Lib to make data intended for machines, readable to humans.

56
Established
242 RocketChat/hubot-natural

Natural Language Processing Chatbot for RocketChat

56
Established
243 delph-in/pydelphin

Python libraries for DELPH-IN

56
Established
244 preligens-lab/textnoisr

Adding random noise to a text dataset, and controlling very accurately the...

56
Established
245 centre-for-humanities-computing/DaCy

DaCy: The State of the Art Danish NLP pipeline using SpaCy

56
Established
246 R1j1t/contextualSpellCheck

✔️Contextual word checker for better suggestions (not actively maintained)

56
Established
247 sileod/tasknet

Easy modernBERT fine-tuning and multi-task learning

56
Established
248 openfactcheck-research/openfactcheck

An Open-source Factuality Evaluation Demo for LLMs

56
Established
249 yogeshhk/MiningResume

Text Mining certain fields from a resume

56
Established
250 zjunlp/OpenUE

[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text

56
Established
251 opencog/link-grammar

The CMU Link Grammar natural language parser

56
Established
252 425776024/nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

56
Established
253 openeventdata/mordecai

Full text geoparsing as a Python library

56
Established
254 fukuball/jieba-php

"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese...

56
Established
255 jidasheng/bi-lstm-crf

A PyTorch implementation of the BI-LSTM-CRF model.

56
Established
256 SWHL/AI-Competition-Collections

AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)

55
Established
257 brucewlee/lftk

[BEA @ ACL 2023] General-purpose tool for linguistic features extraction;...

55
Established
258 huspacy/huspacy

HuSpaCy: industrial-strength Hungarian natural language processing

55
Established
259 JohnSnowLabs/johnsnowlabs

Gateway into the John Snow Labs Ecosystem

55
Established
260 Rostlab/nalaf

NLP framework in python for entity recognition and relationship extraction

55
Established
261 averbis/averbis-python-api

Conveniently access the REST API of Averbis products using Python

55
Established
262 proycon/pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language...

55
Established
263 Extralit/extralit

Fast and accurate systemic data extraction with LLM assistance

55
Established
264 dselivanov/text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

55
Established
265 emres/turkish-deasciifier

Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs

55
Established
266 bnosac/udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and...

55
Established
267 OpenNMT/Tokenizer

Fast and customizable text tokenization library with BPE and SentencePiece support

55
Established
268 shibing624/pytextclassifier

pytextclassifier is a toolkit for text classification....

55
Established
269 quickwit-oss/whichlang

A blazingly fast and lightweight language detection library for Rust

55
Established
270 carlosplanchon/betterhtmlchunking

BetterHTMLChunking is a Python library for intelligent HTML segmentation. It...

55
Established
271 apache/ctakes

Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text.

55
Established
272 yuchenlin/rebiber

A simple tool to update bib entries with their official information (e.g.,...

55
Established
273 polm/cutlet

Japanese to romaji converter in Python

55
Established
274 kakaobrain/word2word

Easy-to-use word-to-word translations for 3,564 language pairs.

55
Established
275 jenojp/negspacy

spaCy pipeline object for negating concepts in text

55
Established
276 stanford-oval/genie-toolkit

The Genie open source kit for voice assistant (formerly known as Almond)

55
Established
277 ikegami-yukino/pymlask

Emotion analyzer for Japanese text

55
Established
278 proycon/folia

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based...

55
Established
279 KoichiYasuoka/esupar

Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT...

55
Established
280 graykode/toeicbert

TOEIC(Test of English for International Communication) solving using...

55
Established
281 guillaume-be/rust-bert

Rust native ready-to-use NLP pipelines and transformer-based models (BERT,...

55
Established
282 FraBle/python-duckling

Python wrapper for wit.ai's Duckling Clojure library

55
Established
283 panggi/pujangga

Pujangga - Indonesian Natural Language Processing Tool with REST API, an...

55
Established
284 bcgov/tno

Today's News Online (TNO) is a news aggregation system that takes in news...

55
Established
285 web-arena-x/webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

55
Established
286 KRLabsOrg/rulechef

Learn rule-based models from examples using LLM-powered synthesis. Replace...

55
Established
287 Systemcluster/kitoken

Fast and versatile tokenizer for language models, compatible with...

55
Established
288 vulnerability-lookup/VulnTrain

A tool to generate datasets and models based on vulnerabilities descriptions...

55
Established
289 fbilhaut/gline-rs

Inference engine for GLiNER models, in Rust

55
Established
290 indix/whatthelang

Lightning Fast Language Prediction 🚀

54
Established
291 SamEdwardes/spacytextblob

A TextBlob sentiment analysis pipeline component for spaCy.

54
Established
292 HzaCode/OneCite

📚 An intelligent toolkit to automatically parse, complete, and format...

54
Established
293 sugarme/tokenizer

NLP tokenizers written in Go language

54
Established
294 explosion/spacy-experimental

🧪 Cutting-edge experimental spaCy components and features

54
Established
295 gutfeeling/word_forms

Accurately generate all possible forms of an English word e.g "election" -->...

54
Established
296 alvinwan/timefhuman

Extract datetimes and durations from natural language text as Python...

54
Established
297 jbesomi/texthero

Text preprocessing, representation and visualization from zero to hero.

54
Established
298 bretttolbert/verbecc

Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French,...

54
Established
299 Blake-Madden/OleanderStemmingLibrary

Porter stemming library (C++)

54
Established
300 BramVanroy/spacy_conll

Pipeline component for spaCy (and other spaCy-wrapped parsers such as...

54
Established