Posted inAmazon Web Services
Amazon Bedrock now supports observability of First Token Latency and Quota Consumption
Amazon Bedrock is a fully managed service for building generative AI applications using high-performing foundation models from leading AI providers. It now supports two new CloudWatch metrics, TimeToFirstToken and EstimatedTPMQuotaUsage, giving you deeper visibility into inference performance and quota consumption.…





