aws/sagemaker-spark

A Spark library for Amazon SageMaker.

/ 100

Established

This tool helps data scientists and machine learning engineers easily integrate their large-scale data processing workflows with Amazon SageMaker's machine learning capabilities. You can feed your Spark DataFrames into SageMaker for training using either Amazon's built-in algorithms or your own custom models, and then get predictions back on Spark DataFrames from the deployed SageMaker models. It's designed for those managing big data machine learning pipelines.

301 stars. No commits in the last 6 months.

Use this if you need to train machine learning models at scale using Amazon SageMaker directly from your Apache Spark applications, or if you want to deploy and get predictions from SageMaker models within your Spark pipelines.

Not ideal if you are not using Apache Spark for data processing, or if you prefer to manage your machine learning workflows entirely outside of the Amazon SageMaker ecosystem.

big-data-processing machine-learning-engineering cloud-ml-training data-science-pipelines predictive-modeling

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

301

Forks

128

Language

Scala

License

Apache-2.0

Compare

sagemaker-spark and sagemaker-python-sdk

Related frameworks

aws/sagemaker-python-sdk

A library for training and deploying machine learning models on Amazon SageMaker

aws/amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning...

aws/sagemaker-xgboost-container

This is the Docker container based on open source framework XGBoost...

aws-samples/aws-ml-enablement-workshop

組織横断的にチームを組成し、機械学習による成長サイクルを実現する計画を立てるワークショップ

aws/sagemaker-training-toolkit

Train machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.

Explore ML Frameworks

All categories Trending ML Framework directory Insights