gregversteeg/corex_topic

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

49
/ 100
Emerging

This tool helps you uncover key themes and topics within large collections of text documents, like news articles, customer feedback, or research papers. You provide your documents and an optional list of important keywords, and it outputs a set of topics, the words most associated with each, and which documents belong to each topic. It's designed for researchers, analysts, or anyone who needs to make sense of unstructured text data without manually reading every piece.

640 stars. No commits in the last 6 months.

Use this if you need to understand the main subjects being discussed across many documents and want to guide the topic discovery process using specific words or phrases you already know are important.

Not ideal if you're looking for a simple keyword extraction tool or don't have sparse count data as input.

text-analysis document-clustering qualitative-research content-discovery information-extraction
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

640

Forks

118

Language

Python

License

Apache-2.0

Last pushed

Mar 22, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/gregversteeg/corex_topic"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.