AI Gateway now supports spend limits — cost-based budgets that track cumulative dollar spend and block requests when the budget is exceeded. Unlike rate limiting, which caps the number of requests, spend limits track actual cost based on token usage and model pricing.
You can scope limits by model, provider, or custom metadata dimensions. For example, give each user a $200/day budget, cap total gateway spend at $10,000/day, or limit a specific model to $50/day per user. Each rule uses a configurable time window with fixed or sliding enforcement.
Spend limits work with both Unified Billing and BYOK requests for models with known pricing.
For more details, refer to the Spend limits documentation.
Source: Cloudflare
Latest Posts
- Power Platform admin center – Advanced connector policies [MC1403390]
![Power Platform admin center – Advanced connector policies [MC1403390] 2 pexels googledeepmind 17485678](data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==)
- Amazon EC2 M8a instances now available in the Asia Pacific (Mumbai) region

- Amazon RDS Custom now supports the latest CU and GDR updates for Microsoft SQL Server

- Amazon EC2 C7a instances are now available in the Asia Pacific (Singapore) Region


![Power Platform admin center – Advanced connector policies [MC1403390] 2 pexels googledeepmind 17485678](https://mwpro.co.uk/wp-content/uploads/2024/08/pexels-googledeepmind-17485678-150x150.webp)



