FinOps Organization
Organization Rate Limits
Throttle request and token volume per organization with configurable reset windows.
Organization rate limits protect the gateway and downstream providers from burst traffic originating from a specific team or entity.
Access
Navigate to Organization → select an org → Rate Limit.
Create a rate limit
Click + Create and configure:
| Field | Description |
|---|---|
| Token Max Limit | Maximum tokens allowed in the reset window |
| Token Reset Duration | How often token usage resets (e.g. 1 hour, 1 day) |
| Request Max Limit | Maximum API requests allowed in the reset window |
| Request Reset Duration | How often request count resets |
| Governed Organization | The organization this limit applies to |
Monitoring
After saving, open the Simulator and select a virtual key scoped to this organization. The Limits row on each governance hierarchy card shows request and token usage bars updating in real time.
When a limit is exceeded, the simulator displays a Rate Limited, Token Limited, or Request Limited alert on the blocked response.
Related
- Rate Limit — tenant-wide rate limit policies
- User Keys — virtual-key-level limits