SageMaker HyperPod now supports fine-grained quota allocation of compute resources

August 14, 2025

SageMaker HyperPod task governance now supports fine-grained compute quota allocation of GPU, Trainium accelerator, vCPU, and vCPU memory within an instance. Administrators can allocate fine-grained compute quota across teams, optimizing compute resource distribution and staying within budget.

Data scientists often execute LLM tasks, like training or inference, that do not require entire HyperPod instances, leading to underutilization of accelerated compute resources. HyperPod task governance enables administrators to manage compute quota allocation across teams. With this capability, administrators can now strategically allocate compute resources, ensuring fair access, preventing resource monopolization, and maximizing cluster utilization. This capability enables fine-grained compute quota allocation in addition to instance-level allocation, aligning with organizational workload demands.

SageMaker HyperPod task governance is available in all AWS Regions where HyperPod is available: US East (N. Virginia), US West (N. California), US West (Oregon), Asia Pacific (Mumbai), Asia Pacific (Singapore), Asia Pacific (Sydney), and Asia Pacific (Tokyo), Europe (Frankfurt), Europe (Ireland), Europe (London), Europe (Stockholm), and South America (São Paulo).

To learn more, visit SageMaker HyperPod webpage, and HyperPod task governance documentation.

SageMaker HyperPod now supports fine-grained quota allocation of compute resources

Share this post

Subscribe to our newsletter

Related posts

More efficient and functional workplaces start with smart building data

Botnet of more than 17 million devices dismantled

Cloud CISO Perspectives: How to build an AI-ready security program for the public sector

Developer’s guide to Gemini Enterprise and A2UI integration

From petabytes to predictions: Easy BigQuery insights in Google Sheets

Products

Services

Company

Stay Updated