taesiri/PersianWordVectors
A set of pre-trained word vectors for Persian language
This project provides pre-trained word embeddings for the Persian language, enabling computers to understand the meaning and relationships between Persian words. It takes large amounts of Persian text and converts words into numerical representations, which can then be used in various applications. This is useful for anyone working with Persian language data, such as computational linguists, researchers, or data scientists building language-aware systems.
No commits in the last 6 months.
Use this if you need to perform tasks like text classification, sentiment analysis, or information retrieval for Persian text, and you want to leverage pre-existing knowledge of word meanings without training a model from scratch.
Not ideal if your primary goal is to analyze languages other than Persian or if you require highly specialized word representations trained on a very specific, niche Persian corpus not covered by general internet text.
Stars
15
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Jul 19, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/taesiri/PersianWordVectors"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
amirshnll/Persian-Swear-Words
Persian Swear Dataset - you can use in your production to filter unwanted content. دیتاست کلمات...
sajjjadayobi/PersianQA
Persian (Farsi) Question Answering Dataset (+ Models)
aghasemi/ChronologicalPersianPoetryDataset
A chronological (up to the century in which the poet has lived) of Persian poetry, extracted...
miras-tech/MirasText
MirasText
BodduSriPavan-111/chandassu
Chandassu: First Python Library for Global Metrical Poetry