texttechnologylab/GerParCor
German Parliamentary Corpus (GerParCor)
GerParCor provides a comprehensive collection of digitized parliamentary protocols from Germany, Austria, Switzerland, and Liechtenstein, spanning three centuries up to 1797. It takes raw, often previously unavailable, historical legislative texts and offers them in a unified, preprocessed format for analysis. This resource is ideal for historians, political scientists, linguists, and researchers studying German-speaking parliamentary discourse over time.
Use this if you need access to a large, standardized dataset of historical and contemporary German-language parliamentary records for research and analysis.
Not ideal if your research focuses on non-Germanic languages or requires only a small, specific selection of recent parliamentary documents.
Stars
30
Forks
10
Language
Java
License
AGPL-3.0
Category
Last pushed
Jan 14, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/texttechnologylab/GerParCor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Helsinki-NLP/OpusFilter
OpusFilter - Parallel corpus processing toolkit
natasha/corus
Links to Russian corpora + Python functions for loading and parsing
darija-open-dataset/dataset
darija <-> english dataset
omicsNLP/Auto-CORPus
Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London...
SergeyShk/ruTS
Библиотека для извлечения статистик из текстов на русском языке.