Round-robin is a straightforward load balancing strategy in which incoming requests are distributed sequentially among a pool of backend targets. This ensures an even rotation of requests across available services, helping balance the load and improving utilization.
Key Benefit: Simple and effective way to spread traffic evenly.
Weighted Support: By assigning different weight values to each target, you can influence how often a target is chosen within the round-robin sequence.
Health Checks: If health checks are enabled, any unhealthy target is removed from the rotation until it becomes healthy again, ensuring that only valid targets serve requests.
Below is an example command to create an Upstream using the round-robin algorithm. The sample request includes two targets—one for OpenAI and another for Anthropic—each with its own path, provider, credentials, and optional model configurations.