Routing

Guardway Gateway decides which provider handles each request using a combination of routing strategies and routing rules. Configure these from the dashboard’s Configuration → Routing page.

Routing strategies

Strategy	Behavior
Priority	Match rules in priority order. Default.
Lowest-latency	Route to the fastest provider based on historical metrics.
Lowest-cost	Route to the cheapest provider for the requested model.
Least-busy	Route to the provider with the lowest concurrent load.
Tag-based	Route based on tags attached to the request.
Auto-router	Adaptive routing based on SLA requirements.

Routing rules

Rules use pattern matching against the model name in the request:

Pattern types: equals, startsWith, contains.
Priority: lower number = higher priority.
Budget limits: optional per-rule USD budget with a time window (hour, day, month). Windows auto-reset.
Token limits: optional per-rule token cap with a time window.
Fallback strategies: next-priority, lowest-cost, lowest-latency, fail.

Rules are evaluated top-to-bottom by priority until one matches. The first match wins; if it fails and has a fallback, the fallback chain runs.

Example

A rule that sends every gpt-4o* request to OpenAI, falls back to the cheapest available provider if OpenAI is down, and caps the team at $100/day:

Field	Value
Pattern	`startsWith: gpt-4o`
Priority	`10`
Target provider	OpenAI
Budget	`$100 / day`
Fallback	`lowest-cost`

Getting Started

Platform

Gateway

Discovery

Resources

API Reference

Routing strategies

Routing rules

Example

Getting Started

Platform

Gateway

Discovery

Resources

API Reference

​Routing strategies

​Routing rules

​Example

Routing strategies

Routing rules

Example