maxoodf/word2vec
word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch
This project helps you understand the relationships between words within large text datasets. You provide a collection of text documents, and it generates a model that represents each word as a numerical vector. This model can then be used by researchers, data scientists, or linguists to explore semantic similarities and analogies between words.
140 stars. No commits in the last 6 months.
Use this if you need to transform vast amounts of text into a numerical format that captures word meanings and relationships, enabling tasks like finding similar words or completing word analogies.
Not ideal if you are a non-developer and prefer a graphical user interface or a ready-to-use API for text analysis.
Stars
140
Forks
24
Language
C++
License
Apache-2.0
Category
Last pushed
Oct 14, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/maxoodf/word2vec"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
shibing624/similarities
Similarities: a toolkit for similarity calculation and semantic search....
explosion/sense2vec
🦆 Contextually-keyed word vectors
chakki-works/chakin
Simple downloader for pre-trained word vectors
sebischair/Lbl2Vec
Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with...
pdrm83/sent2vec
How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.