MagedSaeed/farasapy
A Python implementation of Farasa toolkit
This tool helps researchers and linguists analyze Arabic text by breaking it down into its core components. You input raw Arabic text, and it outputs processed text with tasks like segmentation, stemming, named entity recognition, part-of-speech tagging, diacritization, and lemmatization applied. It's designed for anyone working with computational linguistics or natural language processing of Arabic.
139 stars. No commits in the last 6 months.
Use this if you need to perform in-depth linguistic analysis on Arabic text for research purposes, such as preparing data for machine learning models or understanding language structure.
Not ideal if you need this for commercial applications, as the underlying toolkit's license strictly permits research use only.
Stars
139
Forks
23
Language
Python
License
MIT
Category
Last pushed
Sep 11, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/MagedSaeed/farasapy"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
CAMeL-Lab/camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York...
PetrKorab/Arabica
Python package for text mining of time-series data
markuskiller/textblob-de
German language support for TextBlob.
adhaamehab/textblob-ar
Arabic support for textblob
ARBML/tkseem
Arabic Tokenization Library. It provides many tokenization algorithms.