achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project
Complete-Life-Cycle-of-a-Data-Science-Project
This resource provides a comprehensive guide and toolkit for gathering data from diverse sources across the web. It shows you how to collect structured data like tables, semi-structured data like JSON, and unstructured data such as text and images from websites, social media, and other online platforms. It's designed for anyone who needs to compile large datasets for analysis, research, or building data-driven applications.
638 stars. No commits in the last 6 months.
Use this if you need to systematically collect various types of data from websites, social media (like Twitter, Instagram, LinkedIn, Reddit), e-commerce sites, or news articles to fuel your data projects.
Not ideal if you primarily work with pre-existing, clean datasets and do not require methods for extracting information directly from web sources.
Stars
638
Forks
253
Language
—
License
MIT
Category
Last pushed
Jun 07, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
GoogleCloudPlatform/data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan,...
rjurney/Agile_Data_Code_2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
linogaliana/python-datascientist
Dépôt associé au cours Python pour data scientists (ENSAE 2e année)
yogeshhk/TeachingDataScience
Course notes for Data Science related topics, prepared in LaTeX
PacktWorkshops/The-Data-Science-Workshop
A New, Interactive Approach to Learning Data Science