tblock/10kGNAD
Ten Thousand German News Articles Dataset for Topic Classification
This project helps you organize and categorize large collections of German news articles. You input a database of German news, and it outputs a neatly structured dataset of articles, ready for analysis or use in training classification models. It's ideal for researchers, data scientists, or anyone working with large volumes of German-language media.
No commits in the last 6 months.
Use this if you need a pre-processed dataset of German news articles for topic classification tasks or for training machine learning models.
Not ideal if you are looking for an active news scraper or a real-time news categorization tool.
Stars
87
Forks
15
Language
Python
License
MIT
Category
Last pushed
Nov 07, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/tblock/10kGNAD"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
angelosalatino/cso-classifier
Python library that classifies content from scientific papers with the topics of the Computer...
newsgac/platform
Platform for machine learning experiments developed in the project NEWSGAC
giuseppebonaccorso/Reuters-21578-Classification
Text classification with Reuters-21578 datasets using Gensim Word2Vec and Keras LSTM
aqibsaeed/Research-Paper-Categorization
Research paper classification using machine learning and NLP
timthedev07/news-articles-classification
Classification model for news articles with deep learning.