nuhmanpk/Webtrench

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

48
/ 100
Emerging

This tool helps machine learning and deep learning engineers gather diverse data from websites, including text, images, videos, and metadata. You input a website URL, and the tool outputs the extracted data saved to your specified local folders. It streamlines the data collection process for training models or conducting research.

No commits in the last 6 months. Available on PyPI.

Use this if you need to quickly and easily collect various types of data from public websites for your machine learning projects or data analysis.

Not ideal if the website's structure changes frequently or if you need to scrape sites with strict legal or ethical restrictions against automated data collection.

data-collection web-scraping machine-learning-data research-data-gathering
Stale 6m
Maintenance 0 / 25
Adoption 7 / 25
Maturity 25 / 25
Community 16 / 25

How are scores calculated?

Stars

26

Forks

6

Language

Python

License

MIT

Last pushed

Nov 19, 2023

Commits (30d)

0

Dependencies

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/nuhmanpk/Webtrench"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.