yoshoku/suika
Suika 🍉 is a Japanese morphological analyzer written in pure Ruby
This tool helps Ruby developers process Japanese text by breaking down sentences into individual words and analyzing their grammatical roles. You input raw Japanese sentences, and it outputs each word with its part of speech, base form, and pronunciation. This is for software developers building applications that need to understand or work with Japanese language data.
Use this if you are a Ruby developer and need to perform detailed grammatical analysis of Japanese text within your application.
Not ideal if you need a solution for a language other than Ruby or if you are not a software developer.
Stars
51
Forks
1
Language
Ruby
License
BSD-3-Clause
Category
Last pushed
Jan 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yoshoku/suika"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
EmilStenstrom/conllu
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
OpenPecha/Botok
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
zaemyung/sentsplit
A flexible sentence segmentation library using CRF model and regex rules
taishi-i/nagisa
A Japanese tokenizer based on recurrent neural networks
natasha/razdel
Rule-based token, sentence segmentation for Russian language