clearml/clearml-fractional-gpu

ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing

/ 100

Emerging

This project helps AI developers and researchers efficiently share powerful GPUs among multiple users or AI workloads. It takes your existing AI models or training jobs, packaged as Docker containers, and allows them to run concurrently on the same GPU without one job monopolizing the resources. The output is a more cost-effective and highly utilized GPU infrastructure for AI development.

Use this if you need to run multiple AI workloads or experiments simultaneously on a single GPU, ensuring each container-based job gets a fair share of GPU memory and compute time.

Not ideal if your AI workloads cannot be containerized or if you exclusively use statically partitioned high-end GPUs like NVIDIA MIG without needing dynamic adjustments.

AI-development MLOps resource-management GPU-utilization deep-learning-infrastructure

No Package No Dependents

Maintenance 10 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Featured in

You're Shipping AI You Can't Measure

Higher-rated alternatives

openvinotoolkit/model_server

A scalable inference server for models optimized with OpenVINO™

madroidmaq/mlx-omni-server

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically...

NVIDIA-NeMo/Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based...

generative-computing/mellea

Mellea is a library for writing generative programs.

rhesis-ai/rhesis

Open-source platform & SDK for testing LLM and agentic apps. Define expected behavior, generate...

Explore Generative AI Tools

All categories Trending Generative AI directory Insights