TimSchopf/KeyphraseVectorizers

Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase matrix.

44
/ 100
Emerging

This tool helps you analyze collections of text documents to identify important multi-word phrases. It takes your raw text documents as input and outputs a structured table showing which keyphrases appear in each document, along with their frequencies. Anyone needing to understand the core topics across many texts, such as researchers analyzing papers or marketers reviewing customer feedback, would find this useful.

267 stars. No commits in the last 6 months.

Use this if you need to automatically extract grammatically correct, multi-word keyphrases from documents and quantify their presence.

Not ideal if you're looking for single keywords or if your main goal is simply to count individual words.

text-analysis document-categorization information-retrieval market-research content-analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

267

Forks

38

Language

Python

License

BSD-3-Clause

Last pushed

Nov 08, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/TimSchopf/KeyphraseVectorizers"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.