agatan/yoin
A Japanese Morphological Analyzer written in pure Rust
When working with Japanese text, it can be hard to understand the individual parts of a sentence. This tool helps by taking raw Japanese sentences and breaking them down into individual words and their grammatical roles, like nouns or verbs. Anyone who needs to analyze Japanese text for research, content analysis, or language processing would find this useful.
No commits in the last 6 months.
Use this if you need to automatically break down Japanese sentences into their component words and identify their grammatical properties.
Not ideal if you need to analyze text in languages other than Japanese, or if you require advanced semantic understanding beyond morphological analysis.
Stars
26
Forks
1
Language
Rust
License
MIT
Category
Last pushed
Oct 25, 2019
Monthly downloads
17
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/agatan/yoin"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
EmilStenstrom/conllu
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
OpenPecha/Botok
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
zaemyung/sentsplit
A flexible sentence segmentation library using CRF model and regex rules
taishi-i/nagisa
A Japanese tokenizer based on recurrent neural networks
natasha/razdel
Rule-based token, sentence segmentation for Russian language