yahoo/TensorFlowOnSpark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

60
/ 100
Established

This project helps data scientists and machine learning engineers run their TensorFlow deep learning models on large Apache Spark and Hadoop clusters. You can take your existing TensorFlow code and run it in a distributed fashion, processing vast datasets managed by Spark. It's designed for professionals building and deploying machine learning solutions at scale.

3,859 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to scale your TensorFlow deep learning training or inference tasks across a distributed cluster using Apache Spark and Hadoop, leveraging existing TensorFlow models.

Not ideal if you are developing small-scale models on a single machine or prefer deep learning frameworks other than TensorFlow.

distributed-machine-learning deep-learning-ops big-data-analytics model-training scalable-inference
Stale 6m No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 25 / 25

How are scores calculated?

Stars

3,859

Forks

941

Language

Python

License

Apache-2.0

Last pushed

Jul 10, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/yahoo/TensorFlowOnSpark"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.