NVIDIA-Merlin/dataloader
The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX
This tool helps machine learning engineers efficiently train recommendation systems. It takes large tabular datasets, often stored in formats like Parquet, and feeds them directly to deep learning models in TensorFlow, PyTorch, or JAX. The result is significantly faster model training, especially for datasets too large to fit into system memory.
423 stars. No commits in the last 6 months.
Use this if you are a machine learning engineer working on recommendation systems and experiencing bottlenecks when loading large tabular datasets for model training.
Not ideal if your primary task doesn't involve training deep learning recommendation models or if your datasets are small enough to be handled efficiently by standard framework dataloaders.
Stars
423
Forks
27
Language
Python
License
Apache-2.0
Category
Last pushed
Apr 16, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/NVIDIA-Merlin/dataloader"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
CliMA/Oceananigans.jl
🌊 Julia software for fast, friendly, flexible, ocean-flavored fluid dynamics on CPUs and GPUs
JuliaLang/julia
The Julia Programming Language
WassimTenachi/PhySO
Physical Symbolic Optimization
EnzymeAD/Enzyme.jl
Julia bindings for the Enzyme automatic differentiator
astroautomata/SymbolicRegression.jl
Distributed High-Performance Symbolic Regression in Julia