Today, AWS announces new Amazon Elastic Compute Cloud (Amazon EC2) P5 instance size with one NVIDIA H100 GPU that allows businesses to right-size their machine learning (ML) and high-performance computing (HPC) resources with cost-effectiveness.
The new instance size enables customers to start small and scale in granular increments, providing more flexible control over infrastructure costs. Customers developing small to medium Large Language Models (LLMs) such as chatbots or specialized language translation tools can now run inference tasks more economically. Customers can also use these instances to deploy HPC applications for pharmaceutical discovery, fluid flow analysis, and financial modeling without committing to expensive, large-scale GPU deployments.
P5.4xlarge instances are now available through Amazon EC2 Capacity Blocks for ML in the following AWS Regions: US East (North Virginia, Ohio), US West (Oregon), Europe (London), Asia Pacific (Mumbai, Sydney, Tokyo) and South America (Sao Paulo) regions. These instances can be purchased On-Demand, Spot or through Savings Plans in Europe (London), Asia Pacific (Mumbai, Jakarta, Tokyo), and South America (Sao Paulo) regions.
To learn more about P5.4xlarge instances, visit Amazon EC2 P5 instances.
Categories:
Source: Amazon Web Services
Latest Posts
- (Updated) Microsoft 365 Copilot Chat: New ways to include files and emails in prompts [MC1139489]
- (Updated) Azure Information Protection: Enable multifactor authentication for your Azure tenant by October 1, 2025 [MC1143999]
- (Updated) Microsoft Outlook for iOS/Android: Improved Contacts with profile enrichment and duplicate management [MC1093912]
- (Updated) Microsoft Teams: Android Open Source Project (AOSP) – Device Management auto updates [MC1066157]