ShifuML/guagua
An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.
This framework helps data scientists and machine learning engineers train complex machine learning models faster. It takes large datasets and a model definition as input, and outputs a trained model. It's designed for professionals working with big data and distributed computing platforms like Hadoop.
No commits in the last 6 months.
Use this if you need to significantly speed up the training of large-scale machine learning models, especially neural networks, on Hadoop.
Not ideal if you are working with small datasets or prefer a single-machine training environment.
Stars
75
Forks
41
Language
Java
License
Apache-2.0
Category
Last pushed
May 20, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/ShifuML/guagua"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
lensacom/sparkit-learn
PySpark + Scikit-learn = Sparkit-learn
Angel-ML/angel
A Flexible and Powerful Parameter Server for large-scale machine learning
flink-extended/dl-on-flink
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow,...
MingChen0919/learning-apache-spark
Notes on Apache Spark (pyspark)
mahmoudparsian/data-algorithms-book
MapReduce, Spark, Java, and Scala for Data Algorithms Book