PhantomInsights/mexican-government-report
Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.
This project helps political analysts or researchers extract, clean, and analyze text from large government reports. It takes a PDF document, processes the raw text, and outputs structured data like tokens, entities, and sentiment scores in CSV files. This allows users to understand the key themes, sentiment, and named entities discussed within the report without manual reading.
476 stars. No commits in the last 6 months.
Use this if you need to systematically analyze the content of a long official document, like a government report, to identify key topics, sentiment, and named entities for research or policy analysis.
Not ideal if you need to extract data from highly complex PDFs with tables, images, or intricate layouts, or if your analysis requires real-time processing.
Stars
476
Forks
82
Language
Python
License
MIT
Category
Last pushed
Jan 22, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/PhantomInsights/mexican-government-report"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
AndreCNF/polids
Analysis of electoral manifestos and output of it through apps.
stdlib-js/datasets-sotu
State of the Union addresses by U.S. Presidents.
gyunggyung/National-Petition
청와대 국민청원 분석으로 국민의 생각 알아보기 📈🔬
NLP-UMUTeam/Spanish-PoliCorpus-2020
This dataset contains the code of the paper entitled Predicting Political Ideology from...
AutoViML/featurewiz_polars
New Polars implementation of the classic featurewiz MRMR algorithm. Created by Ram Seshadri....