fingeredman/teanaps-web-scraper
텍스트 분석용 데이터 수집을 위한 웹스크래핑 도구를 제공합니다.
This tool helps researchers and educators gather text data from various online sources for text analysis. It takes URLs from specific sites like movie review platforms, news sites, app stores, or Naver Cafes and outputs structured text data, such as reviews, articles, or posts. This is designed for academics and educators conducting research or teaching text mining.
No commits in the last 6 months.
Use this if you need to collect large volumes of text data from movie reviews, news articles, app store reviews, or Naver Cafe posts for academic text analysis.
Not ideal if you require data from commercial platforms not listed, or if your purpose is for commercial use rather than research or education.
Stars
8
Forks
1
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Sep 18, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/fingeredman/teanaps-web-scraper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.