nuhmanpk/Webtrench
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code
This tool helps machine learning and deep learning engineers gather diverse data from websites, including text, images, videos, and metadata. You input a website URL, and the tool outputs the extracted data saved to your specified local folders. It streamlines the data collection process for training models or conducting research.
No commits in the last 6 months. Available on PyPI.
Use this if you need to quickly and easily collect various types of data from public websites for your machine learning projects or data analysis.
Not ideal if the website's structure changes frequently or if you need to scrape sites with strict legal or ethical restrictions against automated data collection.
Stars
26
Forks
6
Language
Python
License
MIT
Category
Last pushed
Nov 19, 2023
Commits (30d)
0
Dependencies
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/nuhmanpk/Webtrench"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alirezamika/autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
YoongiKim/AutoCrawler
Google, Naver multiprocess image web crawler (Selenium)
machine-learning-apps/Issue-Label-Bot
Code For The Issue Label Bot, an App that automatically labels issues using machine learning,...
lorey/mlscraper
🤖 Scrape data from HTML websites automatically by just providing examples
shaohua0116/ICLR2020-OpenReviewData
Script that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using...