mojtaba-khallash/JHazm
A Java version of Hazm (Python library for digesting Persian text)
This tool helps researchers and data analysts process and understand Persian text by taking raw Persian documents or sentences and extracting structured linguistic information. It can clean text, break it into sentences and words, identify the root form of words, and assign grammatical tags to each word. This is ideal for anyone working with large collections of Persian text data.
No commits in the last 6 months.
Use this if you need to perform in-depth linguistic analysis on Persian text, such as for academic research, content analysis, or building language-aware applications.
Not ideal if your primary need is general-purpose text translation or basic keyword searching, or if you are working with languages other than Persian.
Stars
48
Forks
17
Language
Java
License
MIT
Category
Last pushed
Feb 26, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/mojtaba-khallash/JHazm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
apache/opennlp
Apache OpenNLP
stanfordnlp/CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing,...
stanfordnlp/python-stanford-corenlp
Python interface to CoreNLP using a bidirectional server-client interface.
dkpro/dkpro-core
Collection of software components for natural language processing (NLP) based on the Apache UIMA...
apache/opennlp-sandbox
Apache OpenNLP Sandbox