Amazon Bedrock now supports OpenAI’s open-weight GPT OSS models (120B and 20B) and NVIDIA Nemotron (Nano 9B v2, Nano 12B v2, Nano 30B, Super 120B) models expanding your ability to build and scale generative AI applications with diverse, high-performance foundation models. This offers the flexibility to leverage OpenAI’s and NVIDIA’s latest models alongside other leading AI models through a single, unified API—allowing you to select the best model for each specific use case without changing your application code.
OpenAI GPT OSS models deliver powerful language understanding and generation capabilities with open-weight architectures, enabling enterprises to build sophisticated AI applications with transparency and flexibility. NVIDIA Nemotron models offer both small language model (SLM) and large language model (LLM) capabilities delivering high compute efficiency and accuracy that developers can use to build specialized agentic AI systems. The models are fully open with open weights, datasets, and recipes facilitating transparency and confidence for developers and enterprises.
These models are powered by Mantle, a new distributed inference engine for large-scale machine learning model serving on Amazon Bedrock. Mantle simplifies and expedites onboarding of new models onto Amazon Bedrock, provides highly performant and reliable serverless inference with sophisticated quality of service controls, unlocks higher default customer quotas with automated capacity management and unified pools, and provides out-of-the-box compatibility with OpenAI API specifications. With OpenAI GPT OSS and NVIDIA Nemotron models available in Amazon Bedrock on AWS GovCloud (US), you can accelerate innovation while benefiting from AWS’s enterprise-grade security, seamless scaling, and cost-optimization features compliantly.
Categories: general:products/aws-govcloud-us
Source: Amazon Web Services
Latest Posts
- Microsoft Teams: Front-of-room view control for Webinars and structured meetings in Teams Rooms on Android [MC1316231]
![Microsoft Teams: Front-of-room view control for Webinars and structured meetings in Teams Rooms on Android [MC1316231] 2 pexels pixabay 276517](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- New flexibility and choice for sharing organizational data across Microsoft 365 and Viva apps [MC1316232]
![New flexibility and choice for sharing organizational data across Microsoft 365 and Viva apps [MC1316232] 3 laptop 3087585 1280](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Amazon Managed Grafana now supports dual-stack connectivity (IPv6 and IPv4)

- Amazon ECS introduces pause and continue controls for service deployments


![Microsoft Teams: Front-of-room view control for Webinars and structured meetings in Teams Rooms on Android [MC1316231] 2 pexels pixabay 276517](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-pixabay-276517-150x150.webp)
![New flexibility and choice for sharing organizational data across Microsoft 365 and Viva apps [MC1316232] 3 laptop 3087585 1280](https://mwpro.co.uk/wp-content/uploads/2025/06/laptop-3087585_1280-150x150.webp)


