jonathandinu/spark-ray-data-science

Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with Spark and Ray in the context of a data scientist's standard workflow.

26
/ 100
Experimental

This project provides practical, hands-on lessons to help data scientists or AI practitioners scale their machine learning projects. It teaches how to use Python, Spark, and Ray to handle large datasets and complex computations. You'll learn to take raw data and turn it into scalable insights and deployed AI models.

No commits in the last 6 months.

Use this if you are a data scientist or AI practitioner who needs to process large datasets and scale machine learning or artificial intelligence applications using Python.

Not ideal if you are looking for an academic overview of distributed systems theory without practical application, or if you are not familiar with basic Python programming.

scalable-data-science machine-learning-engineering distributed-computing AI-application-deployment big-data-analytics
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 10 / 25

How are scores calculated?

Stars

53

Forks

5

Language

Jupyter Notebook

License

Last pushed

Jan 16, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/jonathandinu/spark-ray-data-science"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.