bloomberg/fast-noise-aware-topic-clustering

Research code and scripts used in the Silburt et al. (2021) EMNLP 2021 paper 'FANATIC: FAst Noise-Aware TopIc Clustering'

/ 100

Emerging

FANATIC helps you find hidden topics within large collections of text, even when your data is noisy or includes many irrelevant documents. It takes raw text data, like social media posts, and outputs clearly defined topic clusters, along with summaries of what each cluster contains. This is ideal for researchers or analysts who need to make sense of unstructured text.

No commits in the last 6 months.

Use this if you need to automatically group vast amounts of text into meaningful categories and identify noise or irrelevant content within your dataset.

Not ideal if you need an out-of-the-box solution with a graphical interface for general topic modeling without custom code.

text-analysis social-media-research unstructured-data content-categorization information-discovery

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

MIND-Lab/OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models...

i-dot-ai/themefinder

A topic modelling Python package for analysing one-to-many question-answer data.

andifunke/topic-labeling

The project proposes a framework to apply topic models on a text-corpus and eventually topic...

bab2min/tomotopy

Python package of Tomoto, the Topic Modeling Tool

bobxwu/TopMost

A Topic Modeling System Toolkit (ACL 2024 Demo)

Explore NLP Tools

All categories Trending NLP directory Insights