motazsaad/comparable-text-miner
Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary translation, documents alignment, corpus information, text classification, tf-idf computation, text similarity computation, html documents cleaning
This tool helps researchers and analysts studying documents in both Arabic and English. It takes raw text or HTML documents as input and processes them for linguistic analysis, including morphological analysis, part-of-speech tagging, and dictionary translation between the two languages. The output helps users understand document content and relationships across languages for tasks like text classification and similarity computation.
No commits in the last 6 months.
Use this if you need to analyze, translate, and compare large collections of Arabic and English text documents, especially for research or cross-lingual data analysis.
Not ideal if you primarily need quick, conversational translation or are working with languages other than Arabic and English.
Stars
35
Forks
14
Language
Python
License
Apache-2.0
Category
Last pushed
Apr 24, 2017
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/motazsaad/comparable-text-miner"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
islamAndAi/QURAN-NLP
Quran, Hadith, Translations, Tafaseer, Corpus Linguistics. Everything for NLP
yonatanlou/QumranNLP
Modern computational linguistics for the Dead Sea Scrolls
prakhar21/Automatic-Glossary-Generation
The projects lets you extract glossary words and their definitions from a given piece of text...
ronenh24/bible_search_engine
Bible search engine incorporating natural language processing, deep learning, and machine learning.
ymorsi7/QuranicSentiment
Web app that provides relevant Quranic verses based on emotional states, combining sentiment...