LaurentVeyssier/Topic-Modeling-and-Document-Categorization-using-Latent-Dirichlet-Allocation

Categorize documents per topics inferred by LDA algorithm

25
/ 100
Experimental

This project helps you sort through large collections of text, like news headlines or social media posts, to automatically discover the main themes or topics present. You input a large group of unstructured text documents, and it outputs a breakdown of the hidden topics within them, along with which documents relate to which topics. Anyone who needs to understand the overarching subjects in a vast amount of text data, such as market researchers analyzing customer feedback or journalists sifting through archives, would find this useful.

No commits in the last 6 months.

Use this if you have a large collection of text documents and want to automatically uncover the main, underlying themes without pre-defining categories.

Not ideal if you need to classify documents into very specific, pre-defined categories or require highly precise, fine-grained distinctions between document types.

content-analysis market-research information-discovery media-monitoring text-mining
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 8 / 25
Community 13 / 25

How are scores calculated?

Stars

8

Forks

2

Language

Jupyter Notebook

License

Last pushed

Jan 23, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/LaurentVeyssier/Topic-Modeling-and-Document-Categorization-using-Latent-Dirichlet-Allocation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.