Korean Text Processing NLP Tools

Tools and libraries specifically for Korean language tokenization, morphological analysis, and text preprocessing. Does NOT include general multilingual NLP tools, language identification, or Korean-specific applications like sentiment analysis or named entity recognition.

There are 50 korean text processing tools tracked. 9 score above 50 (established tier). The highest-rated is lovit/soynlp at 69/100 with 984 stars. 1 of the top 10 are actively maintained.

Get all 50 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=korean-text-processing&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 lovit/soynlp

한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.

69
Established
2 bab2min/kiwipiepy

Python API for Kiwi

67
Established
3 bab2min/Kiwi

Kiwi(지능형 한국어 형태소 분석기)

65
Established
4 hyunwoongko/kss

KSS: Korean String processing Suite

60
Established
5 shineware/KOMORAN

Korean Morphological Analyzer by shineware

59
Established
6 naver/claf

CLaF: Open-Source Clova Language Framework

54
Established
7 JDongian/python-jamo

Hangul syllable decomposition and synthesis using jamo.

54
Established
8 konlpy/konlpy

Python package for Korean natural language processing.

51
Established
9 haven-jeon/PyKoSpacing

Automatic Korean word spacing with Python

50
Established
10 lovit/soyspacing

띄어쓰기 오류 교정 라이브러리입니다. CRF 와 같은 머신러닝 알고리즘이 아닌, 직관적인 접근법으로 띄어쓰기를 교정합니다.

48
Emerging
11 open-korean-text/open-korean-text

Open Korean Text Processor - An Open-source Korean Text Processor

47
Emerging
12 rokoroku/node-twitter-korean-text

(Deprecated) use open-korean-text

46
Emerging
13 abdalimran/pykotokenizer

PyKoTokenizer is a Korean text tokenizer for Korean Natural Language...

43
Emerging
14 uosdmlab/spark-nkp

Natural Korean Processor for Apache Spark

42
Emerging
15 bage79/nlp4kor

Natural Language Processing for Korean with Deep Learning

42
Emerging
16 koshort/koshort

(deprecated) :cat: koshort is a Python package for Korean internet spoken...

39
Emerging
17 aws-samples/sm-kornlp

A collection of Korean NLP hands-on labs on Amazon SageMaker

37
Emerging
18 Kyubyong/KoParadigm

KoParadigm: Korean Inflectional Paradigm Generator

36
Emerging
19 pnuailab/parser

한국어 문장 분석 시스템 BCD-KL-Parser

35
Emerging
20 bab2min/kiwi-gui

C# API for Kiwi

35
Emerging
21 L0Z1K/para-Kor

Create paraphrasing korean sentence with GPT-3

35
Emerging
22 open-korean-text/open-korean-text-4clj

Open Korean Text Processor wrapper for Clojure

34
Emerging
23 QuoQA-NLP/KoQuillBot

✍️ Korean Paraphrasing Tool Using Round-trip Translation

34
Emerging
24 bit2r/bitNLP

Tools that support "Natural Language Processing" for Korean text analytics.

34
Emerging
25 fuzzythecat/awesome-spacer

Automatic Korean word spacing with TensorFlow 2.0 + Keras

34
Emerging
26 ttytu/UKTA-web

Unififed Korean Text Analyzer including morpheme analysis, lexical features,...

32
Emerging
27 seoyeon9646/KorSEC

KorSEC : Korean Space Error Correction

32
Emerging
28 Huffon/nlp-startups

국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록

32
Emerging
29 mcognetta/ThreeHotKoreanModeling

A repo for parameter-efficient Korean character-level language modeling.

31
Emerging
30 Seokii/Korean_NLP_Tutorial

한국어 자연어처리 튜토리얼

31
Emerging
31 A-baoYang/NLP-techniques-chinese

For learning. Collecting techniques of each step from knowledge graph...

31
Emerging
32 JoonkyuChoi/polyglot-ko-1.3b-lite

Lite Korean language model

31
Emerging
33 kyle-bong/K-TACC

문맥을 고려한 한국어 텍스트 데이터 증강

30
Emerging
34 nakosung/hangul-asm

Hangul disasm/asm

29
Experimental
35 iKnowLab-Projects/ko-flan

한국어 FLAN 데이터 구축과 모델 학습을 위한 프로젝트

29
Experimental
36 sangdee/kss-java

Korean Sentence Splitter

27
Experimental
37 shineware/tutorials

KOMORAN Tutorials

27
Experimental
38 pipidog/CNLP

A toolbox for Chinese Natural Language Processing

25
Experimental
39 shineware/RKOMORAN

RKOMORAN is KOMORAN wrapper for R users

22
Experimental
40 JAICHANGPARK/flutter_kiwi_nlp

Kiwi 기반 한국어 형태소 분석 Flutter 플러그인입니다. Native-first Flutter plugin for Korean...

22
Experimental
41 bit2r/bitTA

기능이 bitNLP로 이관되었습니다. bitNLP를 사용하시기 바랍니다.

21
Experimental
42 binjang/NIKL-dictionary-parser

Unofficial parser for NIKL Dictionary files.

20
Experimental
43 brandazine/elasticsearch-kiwi-analysis-plugin

Kiwi 형태소 분석기 ElasticSearch 플러그인 (Unofficial)

19
Experimental
44 takuti/hive-udf-tokenize_ko

Korean NLP on Hive

18
Experimental
45 oh-gnues-iohc/korean-qa-paraphrase

This repository contains datasets and training resources for paraphrasing...

18
Experimental
46 eubinean/politely

An explainable politeness styler for the Korean Language / 설명가능한 반-존대 변환기

16
Experimental
47 jaeyeongs/ElectraSpacer

Korean Word-Spacing with KoCharELECTRA

12
Experimental
48 dilohn/ch-en-scaffolding

research with machine learning to determine best scaffolding for bilingual students

11
Experimental
49 ryancahildebrandt/hanakotoba

Exploring 花言葉 in Japanese and other literary corpora

11
Experimental
50 hyenee/NLP-Wiki

자연어처리 관련 위키

10
Experimental