giocoal/reddit-tldr-summarizer-and-topic-modeling
Extreme Extractive Text Summarization and Topic Modeling (using LSA and LDA techniques) over Reddit Posts from TLDRHQ dataset.
This project helps social media managers or community moderators quickly understand large volumes of Reddit posts and discussions. It takes raw Reddit post text and automatically generates a short, 'Too Long; Didn't Read' (TL;DR) summary, alongside identifying the main hidden topics within the conversations. This allows users to grasp key points and categorise content efficiently without manually reading every detail.
No commits in the last 6 months.
Use this if you need to rapidly summarize long Reddit posts and identify the core subjects being discussed across many threads.
Not ideal if you need highly nuanced, human-like summaries that interpret meaning beyond simply extracting key sentences, or if your content is from platforms other than Reddit.
Stars
8
Forks
1
Language
Python
License
MIT
Category
Last pushed
Jan 19, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/giocoal/reddit-tldr-summarizer-and-topic-modeling"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
cannlytics/cannlytics
🔥 Cannlytics = cannabis + analytics. Data pipelines, user interfaces, and the best statistics in...
kariemoorman/tiktok-analyzer
TikTok video scraping and multimodal content analysis tool.
rahulkumaran/merkalysis
A marketing tool that helps you to market your products using organic marketing. This tool can...
PhantomInsights/subreddit-analyzer
A comprehensive Data and Text Mining workflow for submissions and comments from any given public...
eaglewarrior/scrape_do_nlp
I have made a package which will extract google news and twitter tweets and do sentiment...