adbar/German-NLP
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
This is a curated collection of resources and tools specifically for working with the German language. It provides access to various types of German text data (corpora) and off-the-shelf software for processing and understanding German text. Researchers, linguists, data scientists, and anyone analyzing or building applications for German text will find this useful.
518 stars. No commits in the last 6 months.
Use this if you need to find existing German text datasets, or discover tools for tasks like sentiment analysis, named entity recognition, or translation specific to the German language.
Not ideal if you are looking for general-purpose NLP tools that aren't focused on German, or if you need to build a system from scratch without leveraging existing resources.
Stars
518
Forks
66
Language
—
License
—
Category
Last pushed
Oct 30, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/adbar/German-NLP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Helsinki-NLP/OpusFilter
OpusFilter - Parallel corpus processing toolkit
natasha/corus
Links to Russian corpora + Python functions for loading and parsing
SergeyShk/ruTS
Библиотека для извлечения статистик из текстов на русском языке.
darija-open-dataset/dataset
darija <-> english dataset
omicsNLP/Auto-CORPus
Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London...