aws-samples/ecs-gpu-scaling

Amazon ECS Auto Scaling for GPU-based Machine Learning Workloads

35
/ 100
Emerging

This project helps engineers manage GPU-based machine learning workloads on Amazon ECS by automatically adjusting the number of running tasks. It takes custom GPU utilization metrics as input and scales your services up or down to efficiently handle demand. It's designed for cloud and ML engineers who deploy and manage containerized ML applications.

No commits in the last 6 months.

Use this if you are an engineer looking to implement horizontal auto-scaling for GPU-intensive machine learning applications running on Amazon ECS.

Not ideal if you are looking for a production-ready solution, as this example is for demonstrative purposes only.

cloud-engineering machine-learning-operations container-orchestration resource-management auto-scaling
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

17

Forks

3

Language

TypeScript

License

MIT-0

Last pushed

Jan 29, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/aws-samples/ecs-gpu-scaling"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.