endymecy/spark-ml-source-analysis

spark ml 算法原理剖析以及具体的源码实现分析

51
/ 100
Established

This project offers detailed explanations and source code analysis for various machine learning algorithms implemented in Spark ML. It helps data scientists and machine learning engineers deepen their understanding of how these algorithms work and are distributed. You can explore a wide range of topics from basic statistics and clustering to dimensionality reduction and feature engineering, providing both theoretical background and practical implementation insights.

1,962 stars. No commits in the last 6 months.

Use this if you are a data scientist or machine learning engineer working with Spark and want to understand the underlying principles and distributed implementations of Spark ML algorithms.

Not ideal if you are looking for a ready-to-use library or tool for data analysis without delving into the internal workings of Spark ML.

distributed-machine-learning algorithm-analysis big-data-analytics spark-development data-science-education
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

1,962

Forks

821

Language

License

Apache-2.0

Last pushed

Mar 25, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/endymecy/spark-ml-source-analysis"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.