ShifuML/guagua

An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.

/ 100

Emerging

This framework helps data scientists and machine learning engineers train complex machine learning models faster. It takes large datasets and a model definition as input, and outputs a trained model. It's designed for professionals working with big data and distributed computing platforms like Hadoop.

No commits in the last 6 months.

Use this if you need to significantly speed up the training of large-scale machine learning models, especially neural networks, on Hadoop.

Not ideal if you are working with small datasets or prefer a single-machine training environment.

distributed-machine-learning model-training big-data-analytics neural-networks hadoop-ecosystem

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

Forks

Language

Java

License

Apache-2.0

Higher-rated alternatives

lensacom/sparkit-learn

PySpark + Scikit-learn = Sparkit-learn

Angel-ML/angel

A Flexible and Powerful Parameter Server for large-scale machine learning

flink-extended/dl-on-flink

Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow,...

MingChen0919/learning-apache-spark

Notes on Apache Spark (pyspark)

mahmoudparsian/data-algorithms-book

MapReduce, Spark, Java, and Scala for Data Algorithms Book

Explore ML Frameworks

All categories Trending ML Framework directory Insights