affjljoo3581/canrevan

대량의 네이버 뉴스 기사를 수집하는 라이브러리입니다.

53
/ 100
Established

This tool helps researchers and data scientists quickly gather large volumes of Korean news articles from Naver News. You input specific categories and date ranges, and it outputs a dataset of news article text. It's designed for anyone building datasets for natural language processing (NLP) tasks, especially in Korean.

No commits in the last 6 months. Available on PyPI.

Use this if you need to build a substantial collection of high-quality Korean news articles for linguistic analysis, model training, or research.

Not ideal if you need news data from sources other than Naver, or if your primary interest is in structured metadata rather than raw article text.

Korean NLP News Data Collection Research Data Text Mining Linguistic Analysis
Stale 6m
Maintenance 0 / 25
Adoption 9 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

97

Forks

19

Language

Python

License

Apache-2.0

Last pushed

Feb 03, 2023

Commits (30d)

0

Dependencies

4

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/affjljoo3581/canrevan"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.