Amazon SageMaker HyperPod now supports deploying both open-weights foundation models from Amazon SageMaker JumpStart and your own fine-tuned models from Amazon S3 and Amazon FSx directly to Amazon SageMaker HyperPod. This enables you to seamlessly train, fine tune, and deploy models on the same HyperPod compute resources, maximizing resource utilization across the entire model lifecycle
In a few quick steps, you can choose an open-weights foundation model from SageMaker JumpStart and quickly deploy it on your SageMaker HyperPod cluster. SageMaker automatically provisions the infrastructure, deploys the model on your cluster, enables auto-scaling, and configures the SageMaker endpoint. SageMaker scales the compute resources up and down through HyperPod task governance as the traffic on model endpoints changes, and automatically publishes metrics to the HyperPod observability dashboard to provide full visibility into model performance.
You can deploy models from SageMaker JumpStart in all AWS Regions where HyperPod is available: US East (N. Virginia), US West (N. California), US West (Oregon), Asia Pacific (Mumbai), Asia Pacific (Singapore), Asia Pacific (Sydney), and Asia Pacific (Tokyo), Europe (Frankfurt), Europe (Ireland), Europe (London), Europe (Stockholm), and South America (São Paulo).
To learn more, visit SageMaker HyperPod webpage, blog, and documentation.
Categories:
Source: Amazon Web Services
Latest Posts
- SharePoint agents are available in Teams app Store [MC1143301]
- Microsoft 365 Copilot | Scoped knowledge support for declarative agents in VIP connectors [MC1143299]
- AWS Management Console now supports assigning a color to an AWS account for easier identification
- AWS Client VPN extends OS support to Windows Arm64 v5.3.0