Amazon SageMaker HyperPod now supports minimum capacity requirements (MinCount) for clusters using Slurm orchestration with continuous provisioning. With continuous provisioning, HyperPod provisions clusters with available partial capacity so you can start your AI/ML jobs quickly, while continuing to provision remaining instances asynchronously in the background. While this provides flexibility, some training workloads require a guaranteed minimum number of nodes before they can start effectively. MinCount lets you specify the minimum number of instances that must be successfully provisioned before an instance group transitions to InService status, giving you greater control over when your cluster becomes available for job scheduling.
This is particularly useful for distributed training workloads using frameworks such as PyTorch FSDP, Megatron-LM, or NVIDIA NeMo, where training jobs are commonly configured with a fixed number of participating nodes and may not start efficiently or correctly with partial cluster capacity. It also benefits teams that need to guarantee a baseline GPU count to meet SLA or cost-efficiency targets before committing to a training run.
You can specify MinInstanceCount in the CreateCluster or UpdateCluster API request to set a minimum capacity threshold for an instance group. The instance group remains in Creating or Updating status until the threshold is met, then transitions to InService and nodes become available for Slurm job scheduling. HyperPod continues launching additional instances beyond MinCount until the target count is reached. If MinCount cannot be satisfied within 3 hours, the system automatically rolls back the instance group to its last known good state.
MinCount for Slurm clusters with continuous provisioning is available in all AWS Regions where Amazon SageMaker HyperPod is supported. To get started on specifying minimum capacity requirements for your cluster, see Minimum capacity requirements (MinCount) in the Amazon SageMaker AI documentation.
Categories: marketing:marchitecture/compute,marketing:marchitecture/artificial-intelligence
Source: Amazon Web Services
Latest Posts
- Amazon SageMaker HyperPod Slurm clusters now support specifying minimum capacity requirements with continuous provisioning

- (Updated) Microsoft Teams: Rule-based enablement of Microsoft 365 third-party apps in the Teams admin center [MC1085133]
![(Updated) Microsoft Teams: Rule-based enablement of Microsoft 365 third-party apps in the Teams admin center [MC1085133] 3 pexels tirachard kumtanom 112571 347139](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Dynamics 365 Contact Center – Update to provide greater granularity to session rejection reasons [MC1324072]
![Dynamics 365 Contact Center – Update to provide greater granularity to session rejection reasons [MC1324072] 4 puppet 1636124 1920](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- (Updated) Microsoft Teams Town halls (Teams Premium): Upload custom backgrounds in Manage what attendees see [MC1307982]
![(Updated) Microsoft Teams Town halls (Teams Premium): Upload custom backgrounds in Manage what attendees see [MC1307982] 5 pexels pixabay 147411](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)


![(Updated) Microsoft Teams: Rule-based enablement of Microsoft 365 third-party apps in the Teams admin center [MC1085133] 3 pexels tirachard kumtanom 112571 347139](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-tirachard-kumtanom-112571-347139-150x150.webp)
![Dynamics 365 Contact Center – Update to provide greater granularity to session rejection reasons [MC1324072] 4 puppet 1636124 1920](https://mwpro.co.uk/wp-content/uploads/2025/06/puppet-1636124_1920-150x150.webp)
![(Updated) Microsoft Teams Town halls (Teams Premium): Upload custom backgrounds in Manage what attendees see [MC1307982] 5 pexels pixabay 147411](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-pixabay-147411-150x150.webp)