ARBML/tnkeeh

Arabic cleaning, normalization and segmentation library.

38
/ 100
Emerging

This library helps anyone working with Arabic text prepare it for analysis or machine learning. It takes raw, uncleaned Arabic text from various sources like files, social media, or web pages and processes it to remove noise, standardize characters, and segment sentences. Data scientists, computational linguists, or researchers focused on Arabic language processing would use this.

No commits in the last 6 months.

Use this if you need to clean, normalize, or segment Arabic text to improve the performance of your language models or analysis tools.

Not ideal if you are working with non-Arabic languages or primarily need advanced linguistic analysis beyond basic cleaning and segmentation.

Arabic NLP Text Preprocessing Computational Linguistics Data Cleaning Natural Language Processing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

74

Forks

9

Language

Python

License

MIT

Category

arabic-nlp-tools

Last pushed

Sep 28, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ARBML/tnkeeh"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.