Load Balancing
Round-Robin
Round-robin is a straightforward load balancing strategy in which incoming requests are distributed sequentially among a pool of backend targets. This ensures an even rotation of requests across available services, helping balance the load and improving utilization.
- Key Benefit: Simple and effective way to spread traffic evenly.
- Weighted Support: By assigning different
weight
values to each target, you can influence how often a target is chosen within the round-robin sequence. - Health Checks: If health checks are enabled, any unhealthy target is removed from the rotation until it becomes healthy again, ensuring that only valid targets serve requests.
Create an Upstream with Round-Robin Strategy
Below is an example command to create an Upstream using the round-robin algorithm. The sample request includes two targets—one for OpenAI and another for Anthropic—each with its own path, provider, credentials, and optional model configurations.