azu/morpheme-match
match function that match token(形態素解析) with sentence.
This tool helps Japanese content creators and editors accurately check if specific words or phrases (tokens) exist within a sentence, even if there are variations in how they're written. You input a Japanese sentence and a list of tokens, and it tells you whether those tokens are present based on their grammatical structure. This is ideal for anyone who needs precise keyword detection in Japanese text.
No commits in the last 6 months. Available on npm.
Use this if you need to reliably detect specific Japanese words or phrases within sentences, going beyond simple string matching to account for grammatical variations.
Not ideal if you are working with languages other than Japanese, or if you only need basic string comparison without considering linguistic morphology.
Stars
14
Forks
3
Language
JavaScript
License
—
Category
Last pushed
Aug 23, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/azu/morpheme-match"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
EmilStenstrom/conllu
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
OpenPecha/Botok
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
taishi-i/nagisa
A Japanese tokenizer based on recurrent neural networks
zaemyung/sentsplit
A flexible sentence segmentation library using CRF model and regex rules
natasha/razdel
Rule-based token, sentence segmentation for Russian language