peterdalle/svensktext
Svenska språkresurser: kvinno- och mansnamn, orter, län, kommuner, länder, nationaliteter, yrken, sentimentlexikon, moral, stoppord, myndigheter m.m.
This project offers a comprehensive collection of Swedish language data, perfect for anyone working with text in Swedish. It includes everything from personal names, places, and occupations to sentiment lexicons and moral-aligned words. Researchers, data analysts, or content creators can easily access this structured data to enrich their projects.
No commits in the last 6 months.
Use this if you need pre-compiled, categorized, and easily machine-readable datasets for Swedish names, locations, occupations, sentiment, or other linguistic features.
Not ideal if you're looking for a natural language processing model or a tool to perform text analysis directly, as this provides raw data resources rather than analytical functionality.
Stars
82
Forks
7
Language
Python
License
—
Category
Last pushed
Jun 03, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/peterdalle/svensktext"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase...
chrismattmann/tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called...
cltk/cltk
The Classical Language Toolkit
allenai/scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
wi2trier/cbrkit
Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.