MaartenGr/VLAC

Vectors of Locally Aggregated Concepts

45
/ 100
Emerging

This tool helps researchers and data scientists analyze document collections by converting text into rich numerical representations. It takes raw text documents and pre-trained word embeddings, then processes them to produce detailed 'concept vectors' for each document. These vectors can then be used for tasks like document classification or similarity analysis.

No commits in the last 6 months. Available on PyPI.

Use this if you need a sophisticated way to represent the thematic content of documents as numerical data for machine learning models, going beyond simple keyword counts.

Not ideal if you're looking for a simple keyword search tool or don't have pre-trained word embeddings for your language.

document-analysis text-mining information-retrieval natural-language-processing data-science
Stale 6m
Maintenance 0 / 25
Adoption 5 / 25
Maturity 25 / 25
Community 15 / 25

How are scores calculated?

Stars

13

Forks

5

Language

Jupyter Notebook

License

MIT

Last pushed

Jun 17, 2024

Commits (30d)

0

Dependencies

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/MaartenGr/VLAC"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.