Amazon Bedrock now supports Responses API on new OpenAI API-compatible service endpoints. Responses API enables developers to achieve asynchronous inference for long-running inference workloads, simplifies tool use integration for agentic workflows, and also supports stateful conversation management. Instead of requiring developers to pass the entire conversation history with each request, Responses API enables them to automatically rebuild context without manual history management. These new service endpoints support both streaming and non-streaming modes, enable reasoning effort support within Chat Completions API, and require only a base URL change for developers to integrate within existing codebases with OpenAI SDK compatibility.
Chat Completions with reasoning effort support is available for all Amazon Bedrock models that are powered by Mantle, a new distributed inference engine for large-scale machine learning model serving on Amazon Bedrock. Mantle simplifies and expedites onboarding of new models onto Amazon Bedrock, provides highly performant and reliable serverless inference with sophisticated quality of service controls, unlocks higher default customer quotas with automated capacity management and unified pools, and provides out-of-the-box compatibility with OpenAI API specifications. Responses API support is available today starting with OpenAI’s GPT OSS 20B/120B models, with support for other models coming soon.
To get started, visit the service documentation here
Categories: general:products/amazon-bedrock,marketing:marchitecture/artificial-intelligence
Source: Amazon Web Services
Latest Posts
- Amazon SageMaker HyperPod now supports data capture for inference workloads

- Microsoft Teams: Front-of-room view control for Webinars and structured meetings in Teams Rooms on Android [MC1316231]
![Microsoft Teams: Front-of-room view control for Webinars and structured meetings in Teams Rooms on Android [MC1316231] 3 pexels pixabay 276517](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- New flexibility and choice for sharing organizational data across Microsoft 365 and Viva apps [MC1316232]
![New flexibility and choice for sharing organizational data across Microsoft 365 and Viva apps [MC1316232] 4 laptop 3087585 1280](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Amazon Managed Grafana now supports dual-stack connectivity (IPv6 and IPv4)



![Microsoft Teams: Front-of-room view control for Webinars and structured meetings in Teams Rooms on Android [MC1316231] 3 pexels pixabay 276517](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-pixabay-276517-150x150.webp)
![New flexibility and choice for sharing organizational data across Microsoft 365 and Viva apps [MC1316232] 4 laptop 3087585 1280](https://mwpro.co.uk/wp-content/uploads/2025/06/laptop-3087585_1280-150x150.webp)

![(Updated) Microsoft Teams admin center: App centric management for app installation and changes to app setup policies [MC795355] 7 (Updated) Microsoft Teams admin center: App centric management for app installation and changes to app setup policies [MC795355]](https://mwpro.co.uk/wp-content/uploads/2025/06/pexels-magda-ehlers-pexels-1329317-96x96.webp)