Anthropic’s Claude Sonnet 4 and OpenAI’s GPT-OSS 120B and 20B models are now available for Batch inference in Amazon Bedrock. With Batch inference, you can run multiple inference requests asynchronously, improving performance on large datasets at 50% of the on-demand inference pricing. Amazon Bedrock offers select foundation models (FMs) from leading AI providers such as Anthropic, OpenAI, Meta, and Amazon for batch inference, making it easier and more cost-effective to process high-volume workloads.
With Batch inference on Claude Sonnet 4 and OpenAI GPT-OSS models, you can process large datasets for scenarios such as document and customer feedback analysis, bulk content generation (e.g., marketing copy, product descriptions), large-scale prompt or output evaluations, automated summarization of knowledge bases and archives, mass categorization of support tickets or emails, and extraction of structured data from unstructured text—at scale and with lower costs. We’ve optimized our Batch offering to deliver higher overall batch throughput on these newer models compared to previous ones. In addition, you can now track your Batch workload progress at the AWS account level with Amazon CloudWatch metrics. For all models, these metrics include total pending records, processed records and tokens per minute, and for Claude models, they also include tokens pending processing.
To learn more about Batch inference in Amazon Bedrock, visit the Batch inference documentation. You can visit Supported Regions and models for batch inference page for more details on supported models and follow Amazon Bedrock API reference to get started with Batch inference.
Categories: general:products/amazon-bedrock,marketing:marchitecture/artificial-intelligence
Source: Amazon Web Services
Latest Posts
- (Updated) Microsoft 365 Copilot: Email triage with pin, flag, archive, and mark read [MC1193695]
![(Updated) Microsoft 365 Copilot: Email triage with pin, flag, archive, and mark read [MC1193695] 2 pexels skitterphoto 390089](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- (Updated) DNS Provisioning Change [MC1048624]
![(Updated) DNS Provisioning Change [MC1048624] 3 pexels any lane 5945734](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- (Updated) Microsoft PowerPoint: “Reuse Slides” feature will retire starting 2026 [MC1179161]
![(Updated) Microsoft PowerPoint: "Reuse Slides" feature will retire starting 2026 [MC1179161] 4 road 6486701 1920](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- (Updated) Microsoft 365 Copilot: Create and view Outlook rules [MC1223821]
![(Updated) Microsoft 365 Copilot: Create and view Outlook rules [MC1223821] 5 pexels asadphoto 1430675](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)

![(Updated) Microsoft 365 Copilot: Email triage with pin, flag, archive, and mark read [MC1193695] 2 pexels skitterphoto 390089](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-skitterphoto-390089-150x150.webp)
![(Updated) DNS Provisioning Change [MC1048624] 3 pexels any lane 5945734](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-any-lane-5945734-150x150.webp)
![(Updated) Microsoft PowerPoint: "Reuse Slides" feature will retire starting 2026 [MC1179161] 4 road 6486701 1920](https://mwpro.co.uk/wp-content/uploads/2025/06/road-6486701_1920-150x150.webp)
![(Updated) Microsoft 365 Copilot: Create and view Outlook rules [MC1223821] 5 pexels asadphoto 1430675](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-asadphoto-1430675-150x150.webp)
