Word Embedding Methods NLP Tools

Tools, implementations, and evaluations of word embedding algorithms and techniques (Word2Vec, GloVe, PPMI, etc.). Does NOT include embedding applications for downstream tasks, multimodal embeddings, or language model embeddings.

There are 59 word embedding methods tools tracked. 2 score above 50 (established tier). The highest-rated is dselivanov/text2vec at 55/100 with 870 stars.

Get all 59 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=word-embedding-methods&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 dselivanov/text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

55
Established
2 vzhong/embeddings

Fast, DB Backed pretrained word embeddings for natural language processing.

53
Established
3 dccuchile/spanish-word-embeddings

Spanish word embeddings computed with different methods and from different corpora

49
Emerging
4 ncbi-nlp/BioSentVec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

48
Emerging
5 ibrahimsharaf/doc2vec

:notebook: Long(er) text representation and classification using Doc2Vec embeddings

46
Emerging
6 avidale/compress-fasttext

Tools for shrinking fastText models (in gensim format)

46
Emerging
7 iarroyof/sentence_embedding

A sentence embedding method based on weighted series

42
Emerging
8 awslabs/sagemaker-privacy-for-nlp

A solution that helps apply a privacy preserving mechanism to NLP data,...

40
Emerging
9 Kekkodf/pypantera

A Python Package for NLP obfuscation using Differential Privacy

34
Emerging
10 rguthrie3/MorphologicalPriorsForWordEmbeddings

Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings

34
Emerging
11 WorksApplications/chiVe

Japanese word embedding with Sudachi and NWJC ๐ŸŒฟ

34
Emerging
12 YuriyGuts/thrones2vec

Using Word2Vec to explore semantic similarities between the entities of "A...

33
Emerging
13 DanilBaibak/Harry_Potter_vs_Word2Vec

This is the example of analysing corpus of texts using Word2Vec

33
Emerging
14 amitvikramraj/Medical-Embeddings-and-Clinical-Trial-Search-Engine

The Project aims to train SkipGram and FastText Models on COVID-19 Clinical...

32
Emerging
15 schoennenbeck/swem

pytorch implementation of the simple word embedding model.

31
Emerging
16 CLARIN-PL/embeddings

Embeddings: State-of-the-art Text Representations for Natural Language...

31
Emerging
17 jeffrichardchemistry/WordFP

A new way to encode words and similarity calculate

31
Emerging
18 mkearney/wactor

Word Factor Vectors

29
Experimental
19 Tuanpham1994/Word-embedding-and-prediction

Word embedding and prediction

29
Experimental
20 seyedsaeidmasoumzadeh/Binary-Text-Classification-Doc2vec-SVM

A Python implementation of a binary text classifier using Doc2Vec and SVM

29
Experimental
21 thoppe/transorthogonal-linguistics

Uses a distributed word representation to finds words along the hyperchord...

29
Experimental
22 shimo-lab/sembei

:rice_cracker: ๅ˜่ชžๅˆ†ๅ‰ฒใ‚’็ตŒ็”ฑใ—ใชใ„ๅ˜่ชžๅŸ‹ใ‚่พผใฟ :rice_cracker:

28
Experimental
23 SigSegvSquad/WordLink-PharmaSearch

A Natural Language Search Enabled for Pharmaceutical research data. We aim...

27
Experimental
24 Turkish-Word-Embeddings/Word-Embeddings-Repository-for-Turkish

Code for "A Comprehensive Analysis of Static Word Embeddings for Turkish"....

27
Experimental
25 viveksck/simplicity

Code and Data for Simple Models for Word Formation in English Slang

27
Experimental
26 EQTPartners/pause

๐ŸŠ PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by...

27
Experimental
27 digitalprk/north_korean_embeddings

Word2Vec Word Vectors trained on a North Korean Corpus / ์กฐ์„ ์–ด (๋ถํ•œ์–ด) ๋‹จ์–ด ์ž„๋ฒ ๋”ฉ

27
Experimental
28 ispras-texterra/word-embeddings-eval-hy

Pre-trained fastText, word2vec, GloVe embeddings for the Armenian language...

26
Experimental
29 vaskonov/burvec

Word Embeddings for Low Resource Languages: The Case of Buryat

26
Experimental
30 unixpickle/wordembed

Word embeddings for natural language processing

26
Experimental
31 SigSegvSquad/WordLink

In this project we try to establish a concrete mathematical relation between...

25
Experimental
32 AsadiAhmad/Word-Embedding

Word Embeding with Simple model, w2v, Simple RNN, LSTM

23
Experimental
33 AsadiAhmad/Word-Embeding-CNN

Word Embeding with CNN

23
Experimental
34 brandonyph/Introduction-to-Word-Embedding-in-R

This page serve as the repository for the script file I used in my...

23
Experimental
35 helboukkouri/embedding-visualization

This is a project for visualizing word embeddings based on the work of...

22
Experimental
36 shubham0204/glove-android

Power of GloVe word embeddings in Android

22
Experimental
37 akash18tripathi/Word-Embeddings

This GitHub repository contains implementations of three popular word...

22
Experimental
38 abdulsalam-bande/swifty

This is a work to improve molecular docking speed. Normally docking a ligand...

22
Experimental
39 RonyAbecidan/PrivateWordEmbeddings

Study of the paper "Differentially Private Representation for NLP"

22
Experimental
40 alvations/vegetables

Collection of Repackaged Word Embeddings

20
Experimental
41 Rajspeaks/Deep-Learning-Approach-to-Bengali-Text-Visualization-using-Word2Vec-Model

This repository consists of Bengali Text-Visualization using Word2Vec Model....

19
Experimental
42 CodeBoyPhilo/VocabOverfit

VocabOverfit: adopting the concept of embedding in memorising vocabularies

19
Experimental
43 Ritika2001/Word-Embedding-Models-for-Subjectivity-Analysis

An Empirical Evaluation of Word Embedding Models for Subjectivity Analysis Tasks

19
Experimental
44 LeonardoEmili/Word-in-Context

Word-in-Context (WiC) as a binary classification task using static word...

19
Experimental
45 trongdang09/word-embeddings

Application of Word Embeddings Model in Natural Language Processing

18
Experimental
46 jreades/ph-tutorial-code

Code to accompany clustering and visualising documents with word embeddings tutorial.

18
Experimental
47 keivanipchihagh/simple-word-embedding

A simple and custom word embedding algorithm

18
Experimental
48 GunjanDhanuka/word2vec_vis

Semantic Word Embeddings Visualizer that has the option to train on your own...

17
Experimental
49 pikachumark/pushingPikachu

Pushing Pikachu is a project that uses a fine-tuned GloVe embedding to find...

17
Experimental
50 grusso98/sins_word_embeddings

7 sins diachronic analysis using CADE on W2V and GloVe embeddings

12
Experimental
51 jayeshk7/Intro-to-NLP

PyTorch implementations of word embeddings and language modelling.

12
Experimental
52 FaisalAhmedBijoy/Document-similarity-using-doc2vec-and-gensim

Document Similarity Measurement Using Doc2Vec and Gensim Library

12
Experimental
53 aditi184/Word-Meaning-in-Comparison

Comparing the same words with the same or different meaning with respect to...

11
Experimental
54 shimo-lab/Drug-Gene-Analogy

Predicting drugโ€“gene relations via analogy tasks with word embeddings...

11
Experimental
55 marialymperaiou/visual-genome-embeddings

Visual Genome word embeddings on region descriptions

11
Experimental
56 andrea-gasparini/nlp-word-in-context-disambiguation

Word-in-Context (WiC) disambiguation experimenting with a word-level...

11
Experimental
57 mohammadataei93/word2vec-embedding-and-topic-clustering

build a word2vec model on hamshahri data and try to clustering embedded...

10
Experimental
58 memgonzales/semantle-word-embeddings

Recreation of Semantle (a word guessing game that gives the semantic...

10
Experimental
59 nabeelshan78/Practical-Word-Embeddings-PyTorch

A hands-on repository demonstrating word embeddings, featuring Word2Vec...

10
Experimental