Today, AWS announces the general availability of Neuron 2.24, delivering new features and performance improvements for customers building and deploying deep learning models on AWS Inferentia and Trainium-based instances. Neuron 2.24 introduces support for PyTorch 2.7, enhanced inference capabilities, and expanded compatibility with popular machine learning frameworks. These updates help developers and data scientists accelerate model training and inference, improve efficiency, and simplify the deployment of large language models and other AI workloads.
With Neuron 2.24, customers can take advantage of advanced inference features such as prefix caching for faster Time-To-First-Token (TTFT), disaggregated inference to reduce prefill-decode interference, and context parallelism for improved performance on long sequences. The release also brings support for Qwen 2.5 text models and improved integration with Hugging Face Optimum Neuron and PyTorch-based NxD Core backend.
Neuron 2.24 is available in all AWS Regions where Inferentia and Trainium instances are offered.
To learn more and for a full list of new features and enhancements, see:
Categories: general:products/amazon-machine-learning,marketing:marchitecture/compute,general:products/aws-tools-and-sdks
Source: Amazon Web Services
Latest Posts
- Power Platform – Block sending customer data from Dataverse audit events to Purview Activity logging [MC1282554]
![Power Platform - Block sending customer data from Dataverse audit events to Purview Activity logging [MC1282554] 2 pexels joshsorenson 1714208](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Copilot entry point changes in Excel [MC1282571]
![Copilot entry point changes in Excel [MC1282571] 3 pexels minan1398 1527255](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Power Apps – Enable online mode to access Dataverse for Canvas apps [MC1282566]
![Power Apps - Enable online mode to access Dataverse for Canvas apps [MC1282566] 4 pexels padrinan 1111367](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Exchange Online, SharePoint Online, and Microsoft Teams: April 2026 industry-wide DigiCert Global Root CA (G1) distrust [MC1282565]
![Exchange Online, SharePoint Online, and Microsoft Teams: April 2026 industry-wide DigiCert Global Root CA (G1) distrust [MC1282565] 5 pexels pixabay 276517](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)

![Power Platform - Block sending customer data from Dataverse audit events to Purview Activity logging [MC1282554] 2 pexels joshsorenson 1714208](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-joshsorenson-1714208-150x150.webp)
![Copilot entry point changes in Excel [MC1282571] 3 pexels minan1398 1527255](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-minan1398-1527255-150x150.webp)
![Power Apps - Enable online mode to access Dataverse for Canvas apps [MC1282566] 4 pexels padrinan 1111367](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-padrinan-1111367-150x150.webp)
![Exchange Online, SharePoint Online, and Microsoft Teams: April 2026 industry-wide DigiCert Global Root CA (G1) distrust [MC1282565] 5 pexels pixabay 276517](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-pixabay-276517-150x150.webp)
