Word Embedding Methods NLP Tools
Tools, implementations, and evaluations of word embedding algorithms and techniques (Word2Vec, GloVe, PPMI, etc.). Does NOT include embedding applications for downstream tasks, multimodal embeddings, or language model embeddings.
There are 59 word embedding methods tools tracked. 2 score above 50 (established tier). The highest-rated is dselivanov/text2vec at 55/100 with 870 stars.
Get all 59 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=word-embedding-methods&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
dselivanov/text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R. |
|
Established |
| 2 |
vzhong/embeddings
Fast, DB Backed pretrained word embeddings for natural language processing. |
|
Established |
| 3 |
dccuchile/spanish-word-embeddings
Spanish word embeddings computed with different methods and from different corpora |
|
Emerging |
| 4 |
ncbi-nlp/BioSentVec
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences |
|
Emerging |
| 5 |
ibrahimsharaf/doc2vec
:notebook: Long(er) text representation and classification using Doc2Vec embeddings |
|
Emerging |
| 6 |
avidale/compress-fasttext
Tools for shrinking fastText models (in gensim format) |
|
Emerging |
| 7 |
iarroyof/sentence_embedding
A sentence embedding method based on weighted series |
|
Emerging |
| 8 |
awslabs/sagemaker-privacy-for-nlp
A solution that helps apply a privacy preserving mechanism to NLP data,... |
|
Emerging |
| 9 |
Kekkodf/pypantera
A Python Package for NLP obfuscation using Differential Privacy |
|
Emerging |
| 10 |
rguthrie3/MorphologicalPriorsForWordEmbeddings
Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings |
|
Emerging |
| 11 |
WorksApplications/chiVe
Japanese word embedding with Sudachi and NWJC ๐ฟ |
|
Emerging |
| 12 |
YuriyGuts/thrones2vec
Using Word2Vec to explore semantic similarities between the entities of "A... |
|
Emerging |
| 13 |
DanilBaibak/Harry_Potter_vs_Word2Vec
This is the example of analysing corpus of texts using Word2Vec |
|
Emerging |
| 14 |
amitvikramraj/Medical-Embeddings-and-Clinical-Trial-Search-Engine
The Project aims to train SkipGram and FastText Models on COVID-19 Clinical... |
|
Emerging |
| 15 |
schoennenbeck/swem
pytorch implementation of the simple word embedding model. |
|
Emerging |
| 16 |
CLARIN-PL/embeddings
Embeddings: State-of-the-art Text Representations for Natural Language... |
|
Emerging |
| 17 |
jeffrichardchemistry/WordFP
A new way to encode words and similarity calculate |
|
Emerging |
| 18 |
mkearney/wactor
Word Factor Vectors |
|
Experimental |
| 19 |
Tuanpham1994/Word-embedding-and-prediction
Word embedding and prediction |
|
Experimental |
| 20 |
seyedsaeidmasoumzadeh/Binary-Text-Classification-Doc2vec-SVM
A Python implementation of a binary text classifier using Doc2Vec and SVM |
|
Experimental |
| 21 |
thoppe/transorthogonal-linguistics
Uses a distributed word representation to finds words along the hyperchord... |
|
Experimental |
| 22 |
shimo-lab/sembei
:rice_cracker: ๅ่ชๅๅฒใ็ต็ฑใใชใๅ่ชๅใ่พผใฟ :rice_cracker: |
|
Experimental |
| 23 |
SigSegvSquad/WordLink-PharmaSearch
A Natural Language Search Enabled for Pharmaceutical research data. We aim... |
|
Experimental |
| 24 |
Turkish-Word-Embeddings/Word-Embeddings-Repository-for-Turkish
Code for "A Comprehensive Analysis of Static Word Embeddings for Turkish".... |
|
Experimental |
| 25 |
viveksck/simplicity
Code and Data for Simple Models for Word Formation in English Slang |
|
Experimental |
| 26 |
EQTPartners/pause
๐ PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by... |
|
Experimental |
| 27 |
digitalprk/north_korean_embeddings
Word2Vec Word Vectors trained on a North Korean Corpus / ์กฐ์ ์ด (๋ถํ์ด) ๋จ์ด ์๋ฒ ๋ฉ |
|
Experimental |
| 28 |
ispras-texterra/word-embeddings-eval-hy
Pre-trained fastText, word2vec, GloVe embeddings for the Armenian language... |
|
Experimental |
| 29 |
vaskonov/burvec
Word Embeddings for Low Resource Languages: The Case of Buryat |
|
Experimental |
| 30 |
unixpickle/wordembed
Word embeddings for natural language processing |
|
Experimental |
| 31 |
SigSegvSquad/WordLink
In this project we try to establish a concrete mathematical relation between... |
|
Experimental |
| 32 |
AsadiAhmad/Word-Embedding
Word Embeding with Simple model, w2v, Simple RNN, LSTM |
|
Experimental |
| 33 |
AsadiAhmad/Word-Embeding-CNN
Word Embeding with CNN |
|
Experimental |
| 34 |
brandonyph/Introduction-to-Word-Embedding-in-R
This page serve as the repository for the script file I used in my... |
|
Experimental |
| 35 |
helboukkouri/embedding-visualization
This is a project for visualizing word embeddings based on the work of... |
|
Experimental |
| 36 |
shubham0204/glove-android
Power of GloVe word embeddings in Android |
|
Experimental |
| 37 |
akash18tripathi/Word-Embeddings
This GitHub repository contains implementations of three popular word... |
|
Experimental |
| 38 |
abdulsalam-bande/swifty
This is a work to improve molecular docking speed. Normally docking a ligand... |
|
Experimental |
| 39 |
RonyAbecidan/PrivateWordEmbeddings
Study of the paper "Differentially Private Representation for NLP" |
|
Experimental |
| 40 |
alvations/vegetables
Collection of Repackaged Word Embeddings |
|
Experimental |
| 41 |
Rajspeaks/Deep-Learning-Approach-to-Bengali-Text-Visualization-using-Word2Vec-Model
This repository consists of Bengali Text-Visualization using Word2Vec Model.... |
|
Experimental |
| 42 |
CodeBoyPhilo/VocabOverfit
VocabOverfit: adopting the concept of embedding in memorising vocabularies |
|
Experimental |
| 43 |
Ritika2001/Word-Embedding-Models-for-Subjectivity-Analysis
An Empirical Evaluation of Word Embedding Models for Subjectivity Analysis Tasks |
|
Experimental |
| 44 |
LeonardoEmili/Word-in-Context
Word-in-Context (WiC) as a binary classification task using static word... |
|
Experimental |
| 45 |
trongdang09/word-embeddings
Application of Word Embeddings Model in Natural Language Processing |
|
Experimental |
| 46 |
jreades/ph-tutorial-code
Code to accompany clustering and visualising documents with word embeddings tutorial. |
|
Experimental |
| 47 |
keivanipchihagh/simple-word-embedding
A simple and custom word embedding algorithm |
|
Experimental |
| 48 |
GunjanDhanuka/word2vec_vis
Semantic Word Embeddings Visualizer that has the option to train on your own... |
|
Experimental |
| 49 |
pikachumark/pushingPikachu
Pushing Pikachu is a project that uses a fine-tuned GloVe embedding to find... |
|
Experimental |
| 50 |
grusso98/sins_word_embeddings
7 sins diachronic analysis using CADE on W2V and GloVe embeddings |
|
Experimental |
| 51 |
jayeshk7/Intro-to-NLP
PyTorch implementations of word embeddings and language modelling. |
|
Experimental |
| 52 |
FaisalAhmedBijoy/Document-similarity-using-doc2vec-and-gensim
Document Similarity Measurement Using Doc2Vec and Gensim Library |
|
Experimental |
| 53 |
aditi184/Word-Meaning-in-Comparison
Comparing the same words with the same or different meaning with respect to... |
|
Experimental |
| 54 |
shimo-lab/Drug-Gene-Analogy
Predicting drugโgene relations via analogy tasks with word embeddings... |
|
Experimental |
| 55 |
marialymperaiou/visual-genome-embeddings
Visual Genome word embeddings on region descriptions |
|
Experimental |
| 56 |
andrea-gasparini/nlp-word-in-context-disambiguation
Word-in-Context (WiC) disambiguation experimenting with a word-level... |
|
Experimental |
| 57 |
mohammadataei93/word2vec-embedding-and-topic-clustering
build a word2vec model on hamshahri data and try to clustering embedded... |
|
Experimental |
| 58 |
memgonzales/semantle-word-embeddings
Recreation of Semantle (a word guessing game that gives the semantic... |
|
Experimental |
| 59 |
nabeelshan78/Practical-Word-Embeddings-PyTorch
A hands-on repository demonstrating word embeddings, featuring Word2Vec... |
|
Experimental |