amirivojdan/shekar
Simplifying Persian NLP for Modern Applications
Shekar helps people working with Persian text prepare it for analysis or publication. It takes raw, messy Persian text—from social media, web pages, or scanned documents—and cleans it up, making it consistent and grammatically correct according to official guidelines. This is for linguists, data analysts, content managers, or anyone needing to process Persian language data accurately.
Available on PyPI.
Use this if you need to reliably clean, standardize, and prepare Persian text for tasks like data analysis, content moderation, or publishing.
Not ideal if your primary need is for languages other than Persian, as its specialized tools are built specifically for Persian text.
Stars
61
Forks
4
Language
Python
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/amirivojdan/shekar"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
roshan-research/hazm
Persian NLP Toolkit
Dadmatech/DadmaTools
DadmaTools is a Persian NLP tools developed by Dadmatech Co.
GlobalMaksimum/sadedegel
A General Purpose NLP library for Turkish
GKalliatakis/Keras-VGG16-places365
Keras code and weights files for the VGG16-places365 and VGG16-hybrid1365 CNNs for scene classification
NC0DER/KeyphraseExtraction
Keyphrase Extraction Review