monologg/KoCharELECTRA

Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)

/ 100

Emerging

This project helps anyone working with Korean text analysis who needs to process the language at a character (syllable) level rather than whole words. It takes Korean sentences or documents as input and outputs a detailed, character-by-character breakdown. This is ideal for natural language processing tasks where understanding individual Korean syllables is crucial. Researchers, data scientists, and machine learning engineers working on Korean NLP applications would use this.

No commits in the last 6 months.

Use this if you need a pre-trained Korean language model that processes text at the syllable level, offering a fine-grained understanding of the language.

Not ideal if your application requires processing Hanja characters, as this model explicitly excludes them from its vocabulary.

Korean NLP text analysis machine learning language modeling natural language processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Compare

KoCharELECTRA and KoELECTRA

Higher-rated alternatives

SKTBrain/KoBERT

Korean BERT pre-trained cased (KoBERT)

monologg/KoELECTRA

Pretrained ELECTRA Model for Korean

monologg/KoBERT-Transformers

KoBERT on 🤗 Huggingface Transformers 🤗 (with Bug Fixed)

VinAIResearch/PhoBERT

PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)

KB-AI-Research/KB-ALBERT

KB국민은행에서 제공하는 경제/금융 도메인에 특화된 한국어 ALBERT 모델

Explore Transformer Models

All categories Trending Transformer directory Insights