corydonbaylor/getwiki
getwiki provides an R wrapper for wikipedia's API, allowing users to easily import text data for analysis
Quickly import Wikipedia article text into R for analysis without manual copy-pasting or dealing with messy HTML. You input article titles or search terms and receive clean, structured text that integrates well with natural language processing workflows. This is for data analysts, researchers, and anyone needing to programmatically gather large amounts of information from Wikipedia.
No commits in the last 6 months.
Use this if you need to programmatically access and import text content from Wikipedia articles into your R environment for text analysis or data processing.
Not ideal if you are looking for advanced web scraping capabilities beyond Wikipedia, or if your primary environment is not R.
Stars
7
Forks
—
Language
R
License
MIT
Category
Last pushed
Oct 21, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/corydonbaylor/getwiki"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
quanteda/quanteda
An R package for the Quantitative Analysis of Textual Data
juliasilge/tidytext
Text mining using tidy tools :sparkles::page_facing_up::sparkles:
massimoaria/tall
Text Analysis for aLL
keyATM/keyATM
An R package for Keyword Assisted Topic Models
gagolews/stringi
Fast and Portable Character String Processing in R (with the Unicode ICU)