riedlma/topictiling
TopicTiling is a text segmentation method that is based on LDA
This tool helps analyze long documents by breaking them down into smaller, coherent segments based on their main topics. You provide a collection of texts and a pre-computed topic model, and it outputs an XML file showing where the topic shifts occur in your documents. It's ideal for researchers or analysts working with extensive textual data.
No commits in the last 6 months.
Use this if you need to automatically identify topic boundaries within large text documents or collections, saving time on manual review.
Not ideal if you need to process non-Latin character languages without pre-segmenting the text, or if you don't have a pre-existing topic model.
Stars
43
Forks
10
Language
Java
License
GPL-3.0
Category
Last pushed
Apr 14, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/riedlma/topictiling"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MIND-Lab/OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models...
i-dot-ai/themefinder
A topic modelling Python package for analysing one-to-many question-answer data.
andifunke/topic-labeling
The project proposes a framework to apply topic models on a text-corpus and eventually topic...
bab2min/tomotopy
Python package of Tomoto, the Topic Modeling Tool
bobxwu/TopMost
A Topic Modeling System Toolkit (ACL 2024 Demo)