polm/unidic-py

Unidic packaged for installation via pip.

/ 100

Established

This project helps linguists, researchers, or anyone working with Japanese text analysis to break down contemporary written Japanese into its core components. It takes raw Japanese text as input and outputs detailed linguistic information for each word, including its part of speech, conjugation, lemma, pronunciation, and accent. This is ideal for natural language processing tasks requiring deep grammatical insight into Japanese.

109 stars. Used by 6 other packages. No commits in the last 6 months. Available on PyPI.

Use this if you need to perform in-depth linguistic analysis of contemporary written Japanese and require a comprehensive dictionary for tasks like morphological analysis, parsing, or language research.

Not ideal if you need a lightweight dictionary and have strict disk space limitations, as this package is quite large.

Japanese-language-processing linguistic-analysis morphological-analysis text-segmentation computational-linguistics

Stale 6m

Maintenance 0 / 25

Adoption 14 / 25

Maturity 25 / 25

Community 14 / 25

How are scores calculated?

Stars

109

Forks

Language

Python

License

MIT

Related tools

blmoistawinde/HarvestText

文本挖掘和预处理工具（文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等），无监督或弱监督方法

huspacy/huspacy

HuSpaCy: industrial-strength Hungarian natural language processing

bnosac/udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based...

BramVanroy/spacy_conll

Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and...

tanloong/neosca

L2SCA & LCA fork: cross-platform, GUI, without Java dependency

Explore NLP Tools

All categories Trending NLP directory Insights