inejc/paragraph-vectors

:page_facing_up: A PyTorch implementation of Paragraph Vectors (doc2vec).

48
/ 100
Emerging

This tool helps you analyze collections of text documents, like articles or papers, by converting them into numerical representations. You input a CSV file where each row is a document, and it outputs a set of 'document vectors' — numerical codes that capture the meaning of each document. This is useful for researchers, data scientists, or anyone who needs to find patterns, similarities, or relationships within large text datasets.

415 stars. No commits in the last 6 months.

Use this if you need to transform a large collection of documents into numerical data for tasks like clustering, classification, or similarity search.

Not ideal if you are looking for a ready-to-use application with a graphical interface, as this requires command-line interaction and some technical setup.

text-analysis document-similarity information-retrieval natural-language-processing research-data-analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

415

Forks

75

Language

Python

License

MIT

Last pushed

Dec 08, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/inejc/paragraph-vectors"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.