linkedin/Avro2TF
Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.
This tool helps machine learning engineers and data scientists prepare their raw dataset for deep learning models. It converts existing machine learning data, often in Avro format with sparse vectors, into the specific tensor format required by TensorFlow. This allows practitioners to efficiently feed their large-scale datasets into deep learning frameworks without manual data transformation.
128 stars. No commits in the last 6 months.
Use this if you are a machine learning engineer working with large datasets in Avro or other Spark-supported formats and need to convert them into TensorFlow's TFRecord tensor format for deep learning model training.
Not ideal if your data is already in TensorFlow-compatible formats or if you are not working with deep learning models that use TensorFlow.
Stars
128
Forks
21
Language
Scala
License
BSD-2-Clause
Category
Last pushed
May 09, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/linkedin/Avro2TF"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
tensorflow/tfx
TFX is an end-to-end platform for deploying production ML pipelines
VowpalWabbit/vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with...
yahoo/TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
projectglow/glow
An open-source toolkit for large-scale genomic analysis
Wei-1/Scala-Machine-Learning
No Dependency Scala Machine Learning Algorithm Gallery