Amazon SageMaker HyperPod now supports managed node autoscaling using Karpenter, enabling customers to automatically scale their clusters to meet dynamic inference and training demands. Real-time inference workloads require automatic scaling to address unpredictable traffic patterns and maintain service level agreements, while optimizing costs. However, organizations often struggle with the operational overhead of installing, configuring, and maintaining complex autoscaling solutions. HyperPod-managed node autoscaling eliminates the undifferentiated heavy lifting of Karpenter setup and maintenance, while providing integrated resilience and fault tolerance capabilities.
Autoscaling on HyperPod with Karpenter enables customers to achieve just-in-time provisioning that rapidly adapts GPU compute for inference traffic spikes. Customers can scale to zero nodes during low-demand periods without maintaining dedicated controller infrastructure and benefit from workload-aware node selection that optimizes instance types and costs. For inference workloads, this provides automatic capacity scaling to handle production traffic bursts, cost reduction through intelligent node consolidation during idle periods, and seamless integration with event-driven pod autoscalers like KEDA. Training workloads also benefit from automatic resource optimization during model development cycles. You can enable autoscaling on HyperPod using the UpdateCluster API with AutoScaling mode set to “Enable” and AutoScalerType set to “Karpenter”.
This feature is available in all AWS Regions where Amazon SageMaker HyperPod EKS clusters are supported. To learn more about autoscaling on SageMaker HyperPod with Karpenter, see the user guide and blog.
Categories: marketing:marchitecture/artificial-intelligence,marketing:marchitecture/cost-management
Source: Amazon Web Services

![Updates available for Microsoft 365 Apps for all channels [MC1217711] 2 pexels leofallflat 1089194](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-leofallflat-1089194-150x150.webp)
![Microsoft Viva: Viva Glint – New permissions [MC1217642] 3 pexels pixabay 163007](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-pixabay-163007-150x150.webp)
![Microsoft Teams: Choose your "Enter" key behavior in Teams Chat Settings [MC1217643] 4 pexels davefilm 2643596](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-davefilm-2643596-150x150.webp)
![Microsoft Edge: Cross-platform policies in the Edge management service [MC1217641] 5 pexels karolina grabowska 4039487](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-karolina-grabowska-4039487-150x150.webp)
