bakrianoo/aravec
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
This project offers pre-trained language models specifically for Arabic text analysis. It takes in Arabic words or phrases from social media (Twitter) or encyclopedic sources (Wikipedia) and outputs numerical representations (vectors) that capture their meaning and relationships. This is invaluable for computational linguists or researchers working with Arabic language data.
417 stars. No commits in the last 6 months.
Use this if you are a computational linguist or researcher who needs to understand the semantic relationships between Arabic words and phrases from large text corpora like Twitter or Wikipedia.
Not ideal if your Arabic text data comes from very specific or niche domains not represented in general social media or encyclopedic content.
Stars
417
Forks
83
Language
Jupyter Notebook
License
—
Category
Last pushed
Apr 04, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/bakrianoo/aravec"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
ICLRandD/Case2Vec
A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law...
Jur1cek/source2vec
Source code embeddings for various programming languages
campfireai/job2vec
Open source model weights for Campfire AI's semantic embedding of human occupations
aziyan99/sword2vec
An simple implementation of skip-gram word2vec
BlackKakapo/Romanian-Word-Embeddings
Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current...