pooya-mohammadi/persian-spell-checker-kenlm
A complete instruction for training a Persian spell checker and a language model based on SymSpell and KenLM, respectively using Wikipedia dataset.
This project helps you create a custom spell checker and a language model specifically for the Persian language. You feed it a large amount of Persian text, like a Wikipedia dump, and it produces a dictionary for spell checking and a model that understands how likely certain Persian word sequences are. This is useful for anyone working with Persian text data, such as content creators, linguists, or data analysts who need to improve text quality or analyze language patterns.
No commits in the last 6 months.
Use this if you need to build a specialized spell checker or a language model for the Persian language from scratch, using a large text corpus.
Not ideal if you need a spell checker for a language other than Persian, or if you require an out-of-the-box solution without training a new model.
Stars
35
Forks
1
Language
Python
License
MIT
Category
Last pushed
Jul 20, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/pooya-mohammadi/persian-spell-checker-kenlm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jxmorris12/language_tool_python
a free python grammar checker 📝✅
languagetool-org/languagetool
Style and Grammar Checker for 25+ Languages
hunspell/hunspell
The most popular spellchecking library.
bakwc/JamSpell
Modern spell checking library - accurate, fast, multi-language
neuspell/neuspell
NeuSpell: A Neural Spelling Correction Toolkit