Text Analysis Frameworks NLP Tools

R packages and frameworks for general-purpose textual data analysis, mining, and quantitative text processing. Does NOT include task-specific applications (sentiment analysis, NER, topic modeling), API wrappers, or language-specific toolkits.

There are 86 text analysis frameworks tools tracked. 1 score above 70 (verified tier). The highest-rated is quanteda/quanteda at 74/100 with 877 stars. 1 of the top 10 are actively maintained.

Get all 86 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=text-analysis-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 quanteda/quanteda

An R package for the Quantitative Analysis of Textual Data

74
Verified
2 juliasilge/tidytext

Text mining using tidy tools :sparkles::page_facing_up::sparkles:

59
Established
3 massimoaria/tall

Text Analysis for aLL

53
Established
4 keyATM/keyATM

An R package for Keyword Assisted Topic Models

53
Established
5 gagolews/stringi

Fast and Portable Character String Processing in R (with the Unicode ICU)

51
Established
6 ropensci/gutenbergr

Search, download, and process public domain texts from Project Gutenberg

50
Established
7 lll-lll-lll-lll/sent-pattern

sent-pattern package categorizes English sentences into one of five basic...

48
Emerging
8 irudnyts/openai

An R package-wrapper around OpenAI API

47
Emerging
9 bnosac/BTM

Biterm Topic Modelling for Short Text with R

47
Emerging
10 Reilly-ConceptsCognitionLab/ConversationAlign

An R package for analyzing linguistic alignment between partners in...

45
Emerging
11 WZBSocialScienceCenter/tmtoolkit

Text Mining and Topic Modeling Toolkit for Python with parallel processing power

43
Emerging
12 bnosac/pattern.nlp

R package to perform sentiment analysis and Parts of Speech tagging for...

42
Emerging
13 harmonydata/harmonyapi

This is the source code for the Harmony project REST API

42
Emerging
14 SajjadPourali/Surnames

Surnames dispersion around the world which sorted by population

41
Emerging
15 ropensci-review-tools/pkgmatch

Find R packages matching either descriptions or other R packages

41
Emerging
16 cschwem2er/stminsights

A Shiny Application for Inspecting Structural Topic Models

40
Emerging
17 khushmeeet/potter-nlp

Data Analysis of harry potter text

39
Emerging
18 harmonydata/harmony_r

R library for Harmony. R package - open source tool using AI for psychology...

39
Emerging
19 moj-analytical-services/pq-tool

Tool to analyse past parliamentary questions with visualisation in RShiny

38
Emerging
20 storieswithsiva/Text-Mining-in-R

📷 Generates Text Analytics using Bag of Words. Upload your data and it will...

37
Emerging
21 chris31415926535/tardis

tardis: Text Analysis with Rules and Dictionaries for Inferring Sentiment

36
Emerging
22 juba/rainette

R implementation of the Reinert text clustering method

36
Emerging
23 quanteda/spacyr

R wrapper to spaCy NLP

36
Emerging
24 jaeyk/tidyethnicnews

R package for turning Ethnic NewsWatch search results into tidyverse-ready dataframes

36
Emerging
25 news-r/nltk4r

Natural Language Toolkit for R

35
Emerging
26 nhsx/stm-survey-text

Structural Topic modelling Code to gain insights from free text responses to...

35
Emerging
27 agusnieto77/TalleR

Taller sobre R para el monitoreo y análisis de la conflictividad social....

35
Emerging
28 news-r/gensimr

📝 Topic Modeling for Humans

34
Emerging
29 ropensci-archive/monkeylearn

:no_entry: ARCHIVED :no_entry: Accesses the Monkeylearn API for Text...

34
Emerging
30 ben-aaron188/snlp

2-day course on Statistical Natural Language Processing in R (foundational level)

33
Emerging
31 Chris1221/goldi

R Package for Gene Ontology Label Discernment and Identification.

33
Emerging
32 mbodke/Twitter-Sentiment-Analysis-using-R-shiny

Project based on text mining:

32
Emerging
33 ropensci-archive/geoparser

:no_entry: ARCHIVED :no_entry:

32
Emerging
34 jaytimm/textpress

A rugged, practical R toolkit for web scraping, stepwise NLP, and...

32
Emerging
35 ErikaJacobs/Harry-Potter-Text-Mining

NLP text analysis of the Harry Potter book series

31
Emerging
36 EmilHvitfeldt/R-text-data

List of textual data sources to be used for text mining in R

31
Emerging
37 ratsgo/models

NLP models

31
Emerging
38 trajceskijovan/Structural-Topic-Modeling-in-R

Structural Topic Modeling in R (published two articles on Medium). STM, LDA,...

30
Emerging
39 evanm31/twittR

R Shiny app for tweet analysis

30
Emerging
40 rivaquiroga/analisis-de-textos-r-2023

Materiales del módulo sobre análisis de texto con R del Diplomado en...

30
Emerging
41 abuchmueller/Twitmo

Collect Twitter data and create topic models with R

30
Emerging
42 brandonleekramer/tidyorgs

A tidy package that detects and standardizes organizations in unstructured text data

29
Experimental
43 Tuanpham1994/Text-visualization-uncovering-structure-and-sentiment-analysis

School assignment for text analytics

29
Experimental
44 news-r/decipher

📇 Simple NLP in R

29
Experimental
45 drdataSpp/nlp_r

This repository contains the NLP and Text Mining R script and the generated...

29
Experimental
46 news-r/textanalysis

Text Analysis in R

29
Experimental
47 BrianWeinstein/googlenlp

An Interface to Google's Cloud Natural Language API

29
Experimental
48 CityofEdmonton/text_depot

Search and analyze your text data

28
Experimental
49 gesiscss/ptm

Introduction to Natural Language Processing with a special emphasis on the...

28
Experimental
50 uribo/textlintr

✅ Natural Language Linter Tools for 'R Markdown' and R Code 💯

28
Experimental
51 news-r/textblob

📜 TextBlob for R

28
Experimental
52 cran-task-views/NaturalLanguageProcessing

CRAN Task View: Natural Language Processing

27
Experimental
53 mkearney/googleapis

R client for accessing Google Cloud Natural Language APIs

26
Experimental
54 leedongwei/data-scooping

Flyweight data mining with R

25
Experimental
55 ropenscilabs/tif

Text Interchange Formats

25
Experimental
56 PythonCoderUnicorn/HarryPotterBooks

Harry Potter books for Text Analysis

24
Experimental
57 doug-friedman/topicdoc

Topic-Specific Diagnostics for LDA and CTM Topic Models

23
Experimental
58 gagolews/stringx

Drop-in replacements for base R string functions powered by stringi

23
Experimental
59 gesiscss/polmineR-examples

A binderized environment for polmineR and the Corpus Workbench (CWB) as backend

22
Experimental
60 samuelae/associatoR

an R package to work with association data

22
Experimental
61 mkearney/dict

Word-Based Dictionaries for Natural Language

21
Experimental
62 corydonbaylor/getwiki

getwiki provides an R wrapper for wikipedia's API, allowing users to easily...

20
Experimental
63 tyiannak/pyScholar

Python Library to Analyse and Visualise Google Scholar Metadata

20
Experimental
64 news-r/stanfordnlp

🎓Stanford NLP in R

20
Experimental
65 stevecondylios/dictionaRy

An R interface to the English Dictionary

20
Experimental
66 systats/textlearnR

A simple collection of well working NLP models (Keras, H2O, StarSpace) tuned...

19
Experimental
67 AdrianRasoOnGit/langstats

langstats is an R package designed to help language researchers explore the...

18
Experimental
68 pachadotdev/freedomhouse

The goal of freedom is to ease the usage of the Freedom in the World dataset...

18
Experimental
69 Diverking/HappyDB

Resources generated for the analysis of the HappyDB corpus

17
Experimental
70 retrofit-lab/ashrae-1836-rp-text-mining

Data and code for the paper "What we talk about when we talk about EEMs:...

17
Experimental
71 peterdalle/svensktext-rpackage

svensktext R package for the svensktext data repository

17
Experimental
72 jengzang/villages-ML

Statistical and NLP analysis of 285K+ natural village toponyms in Guangdong...

16
Experimental
73 rubato103/textmining_KCI_RISS

R을 이용한 한국어 학술 논문 데이터(KCI, RISS) 대상의 형태소 분석, N그램 추출, 토픽 모델링 통합 파이프라인

15
Experimental
74 miserman/lingmatch

An all-in-one R package for the assessment of linguistic similarity

15
Experimental
75 profservin/datalab

Serie de ejercicios para familiarizarse con R

14
Experimental
76 filmicaesthetic/JaneAustenStreamgraphs

Emotion analysis and visualisation of Jane Austen novels in R.

13
Experimental
77 seiichiinoue/cstm

implementation of continuous space topic model for modeling text through...

12
Experimental
78 bit2r/bitTomato

뉴스토마토 뉴스기사 데이터 및 관련 컨텐츠

11
Experimental
79 edvardoss/abbrevTexts

Functions that will make life less sad when working with abbreviated text...

11
Experimental
80 FRBCesab/nlpdevi

:basket: Toolbox for Natural Language Processing

11
Experimental
81 kivanc57/quita_mini

Quita Mini is a text analysis tool designed to calculate various linguistic...

11
Experimental
82 Yusuf-Abol/Alumni-Interaction-and-Conversation-Dynamics-NLP

This Natural Language Processing (NLP) project took a dive into chat...

11
Experimental
83 SUPSI-DEASS-CARES/multilingual_thematic_analysis

Modular R pipeline for multilingual survey analysis — translate, embed,...

11
Experimental
84 sabamadadi/Harry-Potter-Books-NLP-and-Classification

Automatic classification of Harry Potter characters into Hogwarts houses...

10
Experimental
85 Jieyi-Chen-98/Chicago-Housing-Price

This project analyzes zipcode-year housing prices in Chicago area by text...

10
Experimental
86 felixhoffmnn/gepart

GePart: German Party Classification Model. Student project during the 5th...

10
Experimental