Today, AWS has introduced five new Elastic Fabric Adapter (EFA) metrics to enhance network observability for AI/ML and High Performance Computing (HPC) workloads. These new metrics help diagnose performance issues by tracking retransmitted packets and bytes, retransmit timeout events, impaired remote connection events, and unresponsive remote receiver events.
With these new metrics, you can monitor for network congestion or instance configuration issues, allowing for timely action to maintain application performance. The metrics are implemented as counters at the per-EFA device level, accumulating data since instance launch or the most recent driver reset. Stored in the sys filesystem, these metrics counters are accessible via the instance command line. For enhanced monitoring and alerting capabilities, you can integrate these metrics into Prometheus scripts, facilitating export to third-party tools such as Grafana for dashboard creation and alarm setting. The new metrics are available on Nitro v4 (and later) instances and require EFA installer version 1.43.0 or higher. For a full list of metrics and to learn more on how to use them, please visit the Monitor an EFA user guide. For a comprehensive list of instances built on different Nitro system versions, please refer to the AWS Nitro Systems documentation.
These new metrics are supported in all commercial AWS Regions, the AWS GovCloud (US) Regions, and the China Regions. To learn more about EFA, please visit the EFA documentation.
Categories: general:products/aws-govcloud-us,marketing:marchitecture/artificial-intelligence,marketing:marchitecture/networking-and-content-delivery
Source: Amazon Web Services
Latest Posts
- GCP Release Notes: March 06, 2026

- (Updated) Microsoft 365: Modern Access Request and Access Denied web page [MC1188599]
![(Updated) Microsoft 365: Modern Access Request and Access Denied web page [MC1188599] 3 pexels leish 5258251](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- (Updated) Introducing Surveys Agent and Copilot Chat in Microsoft Forms [MC1229954]
![(Updated) Introducing Surveys Agent and Copilot Chat in Microsoft Forms [MC1229954] 4 pexels googledeepmind 18068537](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- (Updated) Upcoming change: disabling Teams meeting recording expiration notification emails [MC1245635]
![(Updated) Upcoming change: disabling Teams meeting recording expiration notification emails [MC1245635] 5 pexels alfonso escalante 1319242 2533092](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)


![(Updated) Microsoft 365: Modern Access Request and Access Denied web page [MC1188599] 3 pexels leish 5258251](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-leish-5258251-150x150.webp)
![(Updated) Introducing Surveys Agent and Copilot Chat in Microsoft Forms [MC1229954] 4 pexels googledeepmind 18068537](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-googledeepmind-18068537-150x150.webp)
![(Updated) Upcoming change: disabling Teams meeting recording expiration notification emails [MC1245635] 5 pexels alfonso escalante 1319242 2533092](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-alfonso-escalante-1319242-2533092-150x150.webp)
![Update Principal owner of a SharePoint Embedded user-owned container through SPAC and PowerShell [MC1152315] 7 Update Principal owner of a SharePoint Embedded user-owned container through SPAC and PowerShell [MC1152315]](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-550498053-16792653-150x150.webp)