btrkeks/jp-deinflector
A high-performance Rust crate for deinflecting Japanese words using perfect hash tables
This tool helps Japanese linguists, developers of dictionary applications, and language learners to process Japanese text by finding the base form of conjugated or inflected Japanese words. It takes an inflected Japanese word and outputs a list of possible root forms. This is useful for anyone building or using Japanese language tools.
Use this if you need to quickly find potential base forms of Japanese words for dictionary lookups or linguistic analysis, especially for high-performance applications.
Not ideal if you need to confirm that the deinflected word is a real word in the Japanese language, as this tool does not perform dictionary lookups.
Stars
7
Forks
—
Language
Rust
License
—
Category
Last pushed
Dec 03, 2025
Monthly downloads
17
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/btrkeks/jp-deinflector"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
EmilStenstrom/conllu
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
OpenPecha/Botok
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
zaemyung/sentsplit
A flexible sentence segmentation library using CRF model and regex rules
taishi-i/nagisa
A Japanese tokenizer based on recurrent neural networks
natasha/razdel
Rule-based token, sentence segmentation for Russian language