juliasilge/tidytext

Text mining using tidy tools :sparkles::page_facing_up::sparkles:

/ 100

Established

This tool helps data analysts and researchers prepare unstructured text data for analysis. It takes text documents (like books, articles, or social media posts) and converts them into a structured format where each word is on its own row, making it easy to count words, filter out common terms, and even analyze the sentiment of the text. Anyone working with large collections of text who needs to extract insights can use this.

1,200 stars.

Use this if you need to systematically analyze text data by breaking it down into individual words or short phrases to understand patterns, frequencies, or sentiment.

Not ideal if you are solely looking for pre-built, complex natural language processing models without needing to prepare the text data yourself.

text-analysis data-preparation linguistic-analysis sentiment-analysis market-research

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 23 / 25

How are scores calculated?

Stars

1,200

Forks

183

Language

License

—

Related tools

quanteda/quanteda

An R package for the Quantitative Analysis of Textual Data

massimoaria/tall

Text Analysis for aLL

keyATM/keyATM

An R package for Keyword Assisted Topic Models

gagolews/stringi

Fast and Portable Character String Processing in R (with the Unicode ICU)

ropensci/gutenbergr

Search, download, and process public domain texts from Project Gutenberg

Explore NLP Tools

All categories Trending NLP directory Insights