juliasilge/tidytext
Text mining using tidy tools :sparkles::page_facing_up::sparkles:
This tool helps data analysts and researchers prepare unstructured text data for analysis. It takes text documents (like books, articles, or social media posts) and converts them into a structured format where each word is on its own row, making it easy to count words, filter out common terms, and even analyze the sentiment of the text. Anyone working with large collections of text who needs to extract insights can use this.
1,200 stars.
Use this if you need to systematically analyze text data by breaking it down into individual words or short phrases to understand patterns, frequencies, or sentiment.
Not ideal if you are solely looking for pre-built, complex natural language processing models without needing to prepare the text data yourself.
Stars
1,200
Forks
183
Language
R
License
—
Category
Last pushed
Feb 21, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/juliasilge/tidytext"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
quanteda/quanteda
An R package for the Quantitative Analysis of Textual Data
massimoaria/tall
Text Analysis for aLL
keyATM/keyATM
An R package for Keyword Assisted Topic Models
gagolews/stringi
Fast and Portable Character String Processing in R (with the Unicode ICU)
ropensci/gutenbergr
Search, download, and process public domain texts from Project Gutenberg