Today, AWS announces new Amazon Elastic Compute Cloud (Amazon EC2) P5 instance size with one NVIDIA H100 GPU that allows businesses to right-size their machine learning (ML) and high-performance computing (HPC) resources with cost-effectiveness.
The new instance size enables customers to start small and scale in granular increments, providing more flexible control over infrastructure costs. Customers developing small to medium Large Language Models (LLMs) such as chatbots or specialized language translation tools can now run inference tasks more economically. Customers can also use these instances to deploy HPC applications for pharmaceutical discovery, fluid flow analysis, and financial modeling without committing to expensive, large-scale GPU deployments.
P5.4xlarge instances are now available through Amazon EC2 Capacity Blocks for ML in the following AWS Regions: US East (North Virginia, Ohio), US West (Oregon), Europe (London), Asia Pacific (Mumbai, Sydney, Tokyo) and South America (Sao Paulo) regions. These instances can be purchased On-Demand, Spot or through Savings Plans in Europe (London), Asia Pacific (Mumbai, Jakarta, Tokyo), and South America (Sao Paulo) regions.
To learn more about P5.4xlarge instances, visit Amazon EC2 P5 instances.
Categories:
Source: Amazon Web Services
Latest Posts
- Power Automate – Reference previous prompts in Copilot for Power Automate desktop [MC1175628]
- Power Apps – Create offline profiles in the maker studio for Canvas apps [MC1171647]
- Dynamics 365 Contact Center – Historical analytics – Enable Intent group and agent group-based metrics and dimensions [MC1175088]
- Introducing Image Search in Microsoft Teams [MC1174858]