maxoodf/word2vec

word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch

/ 100

Emerging

This project helps you understand the relationships between words within large text datasets. You provide a collection of text documents, and it generates a model that represents each word as a numerical vector. This model can then be used by researchers, data scientists, or linguists to explore semantic similarities and analogies between words.

140 stars. No commits in the last 6 months.

Use this if you need to transform vast amounts of text into a numerical format that captures word meanings and relationships, enabling tasks like finding similar words or completing word analogies.

Not ideal if you are a non-developer and prefer a graphical user interface or a ready-to-use API for text analysis.

natural-language-processing computational-linguistics text-mining semantic-analysis data-science

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

140

Forks

Language

C++

License

Apache-2.0

Higher-rated alternatives

shibing624/similarities

Similarities: a toolkit for similarity calculation and semantic search....

explosion/sense2vec

🦆 Contextually-keyed word vectors

chakki-works/chakin

Simple downloader for pre-trained word vectors

sebischair/Lbl2Vec

Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with...

pdrm83/sent2vec

How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.

Explore Embedding Tools

All categories Trending Embeddings directory Insights