JasonKessler/Scattertext-PyData

Notebooks for the Seattle PyData 2017 talk on Scattertext

40
/ 100
Emerging

This project helps data scientists, researchers, and content strategists analyze and visualize how language differs between categories of documents. You input collections of text documents (e.g., political speeches, product reviews, scientific papers) grouped by categories like political party, gender, or topic. It outputs interactive scatter plots that highlight the words and phrases most characteristic of each category, making it easy to spot linguistic distinctions.

141 stars. No commits in the last 6 months.

Use this if you need to quickly identify and visualize the unique language patterns present in different groups of text documents.

Not ideal if you're looking for advanced sentiment analysis or complex topic modeling beyond comparing word usage frequencies.

text-analysis linguistic-comparison document-categorization content-strategy research-data-analysis
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 22 / 25

How are scores calculated?

Stars

141

Forks

52

Language

HTML

License

Last pushed

Jan 12, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/JasonKessler/Scattertext-PyData"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.