What You’ll Find in This Section
- TrustGate vs Others A comparison of TrustGate’s performance against popular alternatives in the API gateway space. This includes latency, throughput, and memory footprint under similar workloads.
-
Local Setup for Benchmarking
Instructions on how to set up and run performance tests locally using common tools such as
wrk
,k6
, or custom scripts. Learn how to simulate real-world traffic, test different plugin combinations, and analyze results.
Why Performance Matters
- 🚀 Low Latency Inference For AI workloads, response time is critical. TrustGate minimizes overhead in request/response pipelines to keep latency predictable.
- 💡 Scalable Design Built for high concurrency, TrustGate efficiently handles multiple simultaneous requests across multiple services and providers.
- 📊 Observability-Friendly Metrics and plugin execution times are exposed and traceable, allowing for accurate performance profiling and bottleneck detection.
Recommended Usage
Use this section when you:- Need to compare TrustGate with your existing gateway or proxy
- Want to establish baseline latency for prompt moderation, rate limiting, or masking
- Are preparing for production load testing or traffic spikes
- Want to optimize performance based on measurable data
Continue to: