iamlxb3/TextDatasetAnalyzer
This is a simple tool for text dataset analysis and multiple datasets comparison. Keywords: corpus, text dataset, text distribution, part-of-speech(pos), zipf's-law, distinct value, concreteness
No commits in the last 6 months.
Stars
3
Forks
1
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Apr 11, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/iamlxb3/TextDatasetAnalyzer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ryanjgallagher/shifterator
Interpretable data visualizations for understanding how texts differ at the word level
HLasse/TextDescriptives
A Python library for calculating a large variety of metrics from text
jboynyc/textnets
Text analysis with networks.
DemetersSon83/Quantitative-Discursive-Analysis
A tool for quantitatively measuring discursive similarity between bodies of text.
sciknoworg/tib-sid
TIB-SID: A bilingual (English/German) dataset of library catalog records with GND subject...