jonathandinu/spark-ray-data-science
Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with Spark and Ray in the context of a data scientist's standard workflow.
This project provides practical, hands-on lessons to help data scientists or AI practitioners scale their machine learning projects. It teaches how to use Python, Spark, and Ray to handle large datasets and complex computations. You'll learn to take raw data and turn it into scalable insights and deployed AI models.
No commits in the last 6 months.
Use this if you are a data scientist or AI practitioner who needs to process large datasets and scale machine learning or artificial intelligence applications using Python.
Not ideal if you are looking for an academic overview of distributed systems theory without practical application, or if you are not familiar with basic Python programming.
Stars
53
Forks
5
Language
Jupyter Notebook
License
—
Category
Last pushed
Jan 16, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/jonathandinu/spark-ray-data-science"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
qualcomm/ai-hub-models
Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized...
petuum/adaptdl
Resource-adaptive cluster scheduler for deep learning training.
zszazi/Deep-learning-in-cloud
List of Deep Learning Cloud Providers
lincc-frameworks/hyrax
Hyrax - A low-code framework for rapid experimentation with ML & unsupervised discovery in astronomy
openhackathons-org/gpubootcamp
This repository consists for gpu bootcamp material for HPC and AI