bakrianoo/aravec

AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.

/ 100

Emerging

This project offers pre-trained language models specifically for Arabic text analysis. It takes in Arabic words or phrases from social media (Twitter) or encyclopedic sources (Wikipedia) and outputs numerical representations (vectors) that capture their meaning and relationships. This is invaluable for computational linguists or researchers working with Arabic language data.

417 stars. No commits in the last 6 months.

Use this if you are a computational linguist or researcher who needs to understand the semantic relationships between Arabic words and phrases from large text corpora like Twitter or Wikipedia.

Not ideal if your Arabic text data comes from very specific or niche domains not represented in general social media or encyclopedic content.

Arabic-NLP computational-linguistics text-analysis semantic-modeling language-research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 23 / 25

How are scores calculated?

Stars

417

Forks

Language

Jupyter Notebook

License

—

Related tools

ICLRandD/Case2Vec

A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law...

Jur1cek/source2vec

Source code embeddings for various programming languages

campfireai/job2vec

Open source model weights for Campfire AI's semantic embedding of human occupations

aziyan99/sword2vec

An simple implementation of skip-gram word2vec

BlackKakapo/Romanian-Word-Embeddings

Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current...

Explore Embedding Tools

All categories Trending Embeddings directory Insights