karen-pal/borges
Datasets de los textos de cuentos de varios autorxs latinoamericanxs. Datasets benchmarks de distintas librerías de sentiment analysis en español sobre un corpus de borges.
This project provides organized collections of short stories from various Latin American authors, along with sentiment analysis data for Jorge Luis Borges's works. It allows literary scholars and researchers to easily access and analyze text data from influential Spanish-language literature. You get full text corpuses or sentiment-annotated sentences, ready for your literary analysis.
No commits in the last 6 months.
Use this if you are a literary scholar, student, or researcher studying Latin American literature and need structured text datasets for quantitative analysis or close reading.
Not ideal if you are looking for general Spanish language datasets not focused on literary works or if your primary interest is in modern conversational Spanish.
Stars
16
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Sep 08, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/karen-pal/borges"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Helsinki-NLP/OpusFilter
OpusFilter - Parallel corpus processing toolkit
natasha/corus
Links to Russian corpora + Python functions for loading and parsing
darija-open-dataset/dataset
darija <-> english dataset
omicsNLP/Auto-CORPus
Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London...
SergeyShk/ruTS
Библиотека для извлечения статистик из текстов на русском языке.