Organization Rate Limits

Throttle request and token volume per organization with configurable reset windows.

Organization rate limits protect the gateway and downstream providers from burst traffic originating from a specific team or entity.

Access

Navigate to Organization → select an org → Rate Limit.

Create a rate limit

Click + Create and configure:

FieldDescription
Token Max LimitMaximum tokens allowed in the reset window
Token Reset DurationHow often token usage resets (e.g. 1 hour, 1 day)
Request Max LimitMaximum API requests allowed in the reset window
Request Reset DurationHow often request count resets
Governed OrganizationThe organization this limit applies to

Monitoring

After saving, open the Simulator and select a virtual key scoped to this organization. The Limits row on each governance hierarchy card shows request and token usage bars updating in real time.

When a limit is exceeded, the simulator displays a Rate Limited, Token Limited, or Request Limited alert on the blocked response.