aws-samples/ecs-gpu-scaling
Amazon ECS Auto Scaling for GPU-based Machine Learning Workloads
This project helps engineers manage GPU-based machine learning workloads on Amazon ECS by automatically adjusting the number of running tasks. It takes custom GPU utilization metrics as input and scales your services up or down to efficiently handle demand. It's designed for cloud and ML engineers who deploy and manage containerized ML applications.
No commits in the last 6 months.
Use this if you are an engineer looking to implement horizontal auto-scaling for GPU-intensive machine learning applications running on Amazon ECS.
Not ideal if you are looking for a production-ready solution, as this example is for demonstrative purposes only.
Stars
17
Forks
3
Language
TypeScript
License
MIT-0
Category
Last pushed
Jan 29, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/aws-samples/ecs-gpu-scaling"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
muna-ai/muna-py
Run AI models anywhere. https://muna.ai/explore
clearml/clearml-pycharm-plugin
ClearML PyCharm Plugin
sql-machine-learning/elasticdl
Kubernetes-native Deep Learning Framework
microsoft/AKSDeploymentTutorial
Tutorial on how to deploy Deep Learning models on GPU enabled Kubernetes cluster
Langhalsdino/Kubernetes-GPU-Guide
This guide should help fellow researchers and hobbyists to easily automate and accelerate there...