affjljoo3581/canrevan
대량의 네이버 뉴스 기사를 수집하는 라이브러리입니다.
This tool helps researchers and data scientists quickly gather large volumes of Korean news articles from Naver News. You input specific categories and date ranges, and it outputs a dataset of news article text. It's designed for anyone building datasets for natural language processing (NLP) tasks, especially in Korean.
No commits in the last 6 months. Available on PyPI.
Use this if you need to build a substantial collection of high-quality Korean news articles for linguistic analysis, model training, or research.
Not ideal if you need news data from sources other than Naver, or if your primary interest is in structured metadata rather than raw article text.
Stars
97
Forks
19
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 03, 2023
Commits (30d)
0
Dependencies
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/affjljoo3581/canrevan"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
flairNLP/fundus
A very simple news crawler with a funny name
fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
FreeDiscovery/FreeDiscovery
Web Service for E-Discovery Analytics
tirthajyoti/Web-Database-Analytics
Web scrapping and related analytics using Python tools
Multiverse-of-Projects/NewsAI
A dynamic NewsAI dashboard that uses NLP to analyze news articles, visualize sentiment trends,...