yuvalpinter/nytwit
New York Times Word Innovation Types dataset
This dataset provides a categorized list of new and unusual words that have appeared in The New York Times, along with their innovation types like blends or compounds. It's designed for linguists, lexicographers, and computational linguists studying how new words enter and evolve in language, offering a resource for analyzing linguistic trends over time.
No commits in the last 6 months.
Use this if you are a linguist, lexicographer, or language researcher analyzing word innovation and neologisms in journalistic text.
Not ideal if you need a general dictionary or thesaurus, or are looking for real-time tracking of newly coined words across a wide range of sources.
Stars
21
Forks
5
Language
—
License
GPL-3.0
Category
Last pushed
Dec 01, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yuvalpinter/nytwit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ryanjgallagher/shifterator
Interpretable data visualizations for understanding how texts differ at the word level
HLasse/TextDescriptives
A Python library for calculating a large variety of metrics from text
jboynyc/textnets
Text analysis with networks.
DemetersSon83/Quantitative-Discursive-Analysis
A tool for quantitatively measuring discursive similarity between bodies of text.
sciknoworg/tib-sid
TIB-SID: A bilingual (English/German) dataset of library catalog records with GND subject...