ibrahimsharaf/doc2vec
:notebook: Long(er) text representation and classification using Doc2Vec embeddings
This tool helps you automatically categorize longer pieces of text, such as customer feedback or articles, based on their overall meaning. You provide text documents, and it tells you which category each document belongs to. Marketers, customer support managers, or researchers who need to sort large volumes of text data would find this useful.
109 stars. No commits in the last 6 months.
Use this if you need to classify documents or longer texts into predefined categories, like determining if a movie review is positive or negative.
Not ideal if you need to classify very short snippets of text or individual words, or if your text classification requires highly specialized domain knowledge without extensive training data.
Stars
109
Forks
42
Language
Python
License
MIT
Category
Last pushed
Jun 17, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ibrahimsharaf/doc2vec"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dselivanov/text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
vzhong/embeddings
Fast, DB Backed pretrained word embeddings for natural language processing.
dccuchile/spanish-word-embeddings
Spanish word embeddings computed with different methods and from different corpora
ncbi-nlp/BioSentVec
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
avidale/compress-fasttext
Tools for shrinking fastText models (in gensim format)