tanyuqian/redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference

44
/ 100
Emerging

This tool helps machine learning engineers and researchers efficiently train and run large AI models, like those for generating images or understanding language, across multiple GPUs or TPUs. It takes your model code and data, then automates the complex setup for distributed processing, giving you faster training and inference results without needing specialized distributed systems expertise. It's for anyone developing or deploying large-scale AI applications.

No commits in the last 6 months. Available on PyPI.

Use this if you need to train or run large AI models (e.g., LLMs, Stable Diffusion) more quickly by distributing them across multiple GPUs or TPUs, without diving deep into complex distributed systems configurations.

Not ideal if you are working with small models that don't require distributed computing or if you prefer to manually configure every aspect of your distributed ML pipeline.

large-language-models deep-learning-training model-inference gpu-acceleration machine-learning-engineering
Stale 6m
Maintenance 0 / 25
Adoption 8 / 25
Maturity 25 / 25
Community 11 / 25

How are scores calculated?

Stars

69

Forks

7

Language

Python

License

Apache-2.0

Last pushed

Dec 09, 2024

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/tanyuqian/redco"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.