ibrahimsharaf/doc2vec

:notebook: Long(er) text representation and classification using Doc2Vec embeddings

/ 100

Emerging

This tool helps you automatically categorize longer pieces of text, such as customer feedback or articles, based on their overall meaning. You provide text documents, and it tells you which category each document belongs to. Marketers, customer support managers, or researchers who need to sort large volumes of text data would find this useful.

109 stars. No commits in the last 6 months.

Use this if you need to classify documents or longer texts into predefined categories, like determining if a movie review is positive or negative.

Not ideal if you need to classify very short snippets of text or individual words, or if your text classification requires highly specialized domain knowledge without extensive training data.

text-categorization sentiment-analysis document-sorting customer-feedback-analysis content-moderation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

109

Forks

Language

Python

License

MIT

Higher-rated alternatives

dselivanov/text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

vzhong/embeddings

Fast, DB Backed pretrained word embeddings for natural language processing.

dccuchile/spanish-word-embeddings

Spanish word embeddings computed with different methods and from different corpora

ncbi-nlp/BioSentVec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

avidale/compress-fasttext

Tools for shrinking fastText models (in gensim format)

Explore NLP Tools

All categories Trending NLP directory Insights