kiwiwu02/GroupBT_UM_Programming_Project
这是一个端到端的中文新闻数据分析项目:从爬取中国新闻网和新浪新闻的文章开始,完成清洗与管理后,用相邻字符共现、PMI、词频与文档互信息等可解释指标进行分析,并输出热图、柱状图、网络图和词云等可视化。项目强调可复现的代码流程与透明指标,对比两家媒体的词汇与风格差异,支持后续主题发现与内容对比。
Stars
1
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Dec 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/kiwiwu02/GroupBT_UM_Programming_Project"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PhantomInsights/mexican-government-report
Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file...
AndreCNF/polids
Analysis of electoral manifestos and output of it through apps.
stdlib-js/datasets-sotu
State of the Union addresses by U.S. Presidents.
gyunggyung/National-Petition
청와대 국민청원 분석으로 국민의 생각 알아보기 📈🔬
NLP-UMUTeam/Spanish-PoliCorpus-2020
This dataset contains the code of the paper entitled Predicting Political Ideology from...