Today, AWS announces the general availability of Neuron SDK 2.25.0, delivering improvements for inference workloads and performance monitoring on AWS Inferentia and Trainium instances. This latest release adds context and data parallelism support as well as chunked attention for long sequence processing in inference, and updates the neuron-ls and neuron-monitor APIs with more information on node affinities and device utilization, respectively.
This release also introduces automatic aliasing (Beta) for fast tensor operations, and adds improvements for disaggregated serving (Beta). Finally, it provides upgraded AMIs and Deep Learning Containers for inference and training workloads on Neuron.
Neuron 2.25.0 is available in all AWS Regions where Inferentia and Trainium instances are offered.
To learn more and for a full list of new features and enhancements, see:
Categories: marketing:marchitecture/artificial-intelligence,marketing:marchitecture/compute
Source: Amazon Web Services
Latest Posts
- Power Platform admin center – Advanced connector policies [MC1403390]
![Power Platform admin center – Advanced connector policies [MC1403390] 2 pexels googledeepmind 17485678](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Amazon EC2 M8a instances now available in the Asia Pacific (Mumbai) region

- Amazon RDS Custom now supports the latest CU and GDR updates for Microsoft SQL Server

- Amazon EC2 C7a instances are now available in the Asia Pacific (Singapore) Region


![Power Platform admin center – Advanced connector policies [MC1403390] 2 pexels googledeepmind 17485678](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-googledeepmind-17485678-150x150.webp)



