kevincobain2000/jProcessing

Japanese Natural Langauge Processing Libraries

/ 100

Emerging

This helps people working with Japanese text by breaking down sentences into individual words, converting text to its phonetic Katakana or Romaji pronunciations, and finding similarities between Japanese phrases. It takes raw Japanese text as input and produces structured linguistic information or phonetic representations, making it useful for linguists, language learners, or data analysts processing Japanese content.

148 stars. No commits in the last 6 months.

Use this if you need to analyze Japanese text at a granular level, convert Japanese characters to their phonetic spellings, or compare the similarity of different Japanese sentences.

Not ideal if your primary need is for advanced machine translation, speech recognition, or complex conversational AI in Japanese, as it focuses on foundational linguistic processing.

Japanese-language-processing text-analysis linguistics language-learning data-preparation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

148

Forks

Language

OpenEdge ABL

License

BSD-2-Clause

Higher-rated alternatives

EmilStenstrom/conllu

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.

OpenPecha/Botok

🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python

taishi-i/nagisa

A Japanese tokenizer based on recurrent neural networks

zaemyung/sentsplit

A flexible sentence segmentation library using CRF model and regex rules

natasha/razdel

Rule-based token, sentence segmentation for Russian language

Explore NLP Tools

All categories Trending NLP directory Insights