Korean Language Models

Pretrained transformer models specifically designed for Korean language processing, including BERT, ELECTRA, and specialized variants. Does NOT include general multilingual models, non-Korean language models, or downstream task-specific applications (unless they primarily showcase the Korean model architecture itself).

There are 33 korean language models tracked. 2 score above 50 (established tier). The highest-rated is SKTBrain/KoBERT at 53/100 with 1,407 stars.

Get all 33 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=korean-language-models&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 SKTBrain/KoBERT

Korean BERT pre-trained cased (KoBERT)

53
Established
2 monologg/KoELECTRA

Pretrained ELECTRA Model for Korean

51
Established
3 monologg/KoBERT-Transformers

KoBERT on ๐Ÿค— Huggingface Transformers ๐Ÿค— (with Bug Fixed)

47
Emerging
4 VinAIResearch/PhoBERT

PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)

47
Emerging
5 KB-AI-Research/KB-ALBERT

KB๊ตญ๋ฏผ์€ํ–‰์—์„œ ์ œ๊ณตํ•˜๋Š” ๊ฒฝ์ œ/๊ธˆ์œต ๋„๋ฉ”์ธ์— ํŠนํ™”๋œ ํ•œ๊ตญ์–ด ALBERT ๋ชจ๋ธ

46
Emerging
6 ymcui/MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

44
Emerging
7 monologg/KoBERT-KorQuAD

Korean MRC (KorQuAD) with KoBERT

44
Emerging
8 monologg/DistilKoBERT

Distillation of KoBERT from SKTBrain (Lightweight KoBERT)

42
Emerging
9 Beomi/KcELECTRA

๐Ÿค— Korean Comments ELECTRA: ํ•œ๊ตญ์–ด ๋Œ“๊ธ€๋กœ ํ•™์Šตํ•œ ELECTRA ๋ชจ๋ธ

41
Emerging
10 monologg/korean-hate-speech-koelectra

Bias, Hate classification with KoELECTRA ๐Ÿ‘ฟ

40
Emerging
11 monologg/KoCharELECTRA

Character-level Korean ELECTRA Model (์Œ์ ˆ ๋‹จ์œ„ ํ•œ๊ตญ์–ด ELECTRA)

40
Emerging
12 monologg/KoBigBird

๐Ÿฆ… Pretrained BigBird Model for Korean (up to 4096 tokens)

40
Emerging
13 thevasudevgupta/bigbird

Google's BigBird (Jax/Flax & PyTorch) @ ๐Ÿค—Transformers

40
Emerging
14 monologg/KoELECTRA-Pipeline

Transformers Pipeline with KoELECTRA

37
Emerging
15 toriving/text-classification-transformers

Easy text classification for everyone : Bert based models via Huggingface...

37
Emerging
16 monologg/HanBert-Transformers

HanBert on ๐Ÿค— Huggingface Transformers ๐Ÿค—

36
Emerging
17 bayartsogt-ya/albert-mongolian

ALBERT trained on Mongolian text corpus

34
Emerging
18 sajjjadayobi/ParsBigBird

Persian Bert For Long-Range Sequences

33
Emerging
19 SciCrunch/bio_electra

Bio-Electra - Small and efficient discriminatively pre-trained language...

31
Emerging
20 yejoon-lee/kr3

KR3: Korean Restaurant Review with Ratings / Experiments on...

25
Experimental
21 oneonlee/KoAirBERT

๐Ÿค— ํ•ญ๊ณต ์•ˆ์ „ ๋„๋ฉ”์ธ์— ํŠนํ™”๋œ ํ•œ๊ตญ์–ด BERT ๋ชจ๋ธ โœˆ๏ธ

25
Experimental
22 codegram/calbert

Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)

23
Experimental
23 Nikki-oo7/pos-tagger

Part-of-Speech Tagger implemented in PyTorch using BiLSTM and Transformer models.

21
Experimental
24 edoost/pert

Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging

20
Experimental
25 HRSadeghi/Joint_Comma_and_Kasreh_Recognizer

In this repository, we provide a joint neural model based on BERT and two...

19
Experimental
26 phanxuanphucnd/CoBERTa

CoBERTa is a pre-trained models are the pre-trained language models for...

19
Experimental
27 ilos-vigil/bigbird-small-indonesian

Lighweight Indonesian language model for long sequence.

19
Experimental
28 bab2min/kiwi-farm

Kiwi ํ˜•ํƒœ์†Œ ๋ถ„์„๊ธฐ๋ฅผ ํ™œ์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ์–ธ์–ด ๋ชจ๋ธ ์‹คํ—˜์‹ค

16
Experimental
29 amanaser/BabyLM-ELECTRA-Pre-training

BabyLM ELECTRA Pre-training on NVIDIA L40 GPU Cluster.

13
Experimental
30 vanhai1231/phobert-vi-comment

Finetune mรด hรฌnh PhoBERT cho phรขn loแบกi comment trรชn khรดng gian mแบกng

12
Experimental
31 Quasar-Kim/kc-moe

ํ•œ๊ตญ์–ด ๋Œ“๊ธ€ ๋ฐ์ดํ„ฐ์…‹์— ํ›ˆ๋ จ์‹œํ‚จ Pretrained MoE(Mixture-of-Experts) ๋ชจ๋ธ

12
Experimental
32 davitjnz/electra-ka

BERT / ELETRA model for Georgian Language

11
Experimental
33 rrayhka/pos-tagger

POS Tagger menggunakan model dari Hugging Face untuk melakukan tagging...

11
Experimental