Observability & Monitoring
Prometheus Integration
This guide outlines how to integrate TrustGate with Prometheus for metrics scraping and Grafana for observability dashboards.
Overview
TrustGate exposes a /metrics
endpoint compatible with Prometheus. You can use this to collect metrics such as request throughput, latency distributions, and service health indicators. These metrics can then be visualized via Grafana dashboards.
Prometheus Configuration
To enable metrics collection from TrustGate, add the following scrape_config
section to your Prometheus configuration file (prometheus.yml
):
Grafana Dashboard
Create a Grafana dashboard to visualize key metrics:
- Request Overview
- Total requests by status code
- Request rate over time
- Active connections
- Latency Metrics
- Overall request latency (p50, p90, p99)
- Service-specific latency
- Upstream latency distribution
- Service Health
- Success rate by service
- Error rate by route
- Connection status
Example PromQL Queries
Request Rate
95th Percentile Latency
Error Rate
Active Connections
Best Practices
- Alert Configuration
- Set up alerts for high error rates
- Monitor latency thresholds
- Track connection limits
- Watch for request spikes
- Dashboard Organization
- Group related metrics
- Use appropriate time ranges
- Include service-level views
- Add error tracking panels
- Metric Collection
- Set appropriate scrape intervals
- Configure retention periods
- Monitor metric cardinality
- Use label aggregation
- Performance Monitoring
- Track latency trends
- Monitor resource usage
- Watch for bottlenecks
- Analyze traffic patterns
Troubleshooting
Common monitoring issues and solutions:
- High Latency
- Check upstream service latency
- Review connection pooling
- Monitor resource usage
- Analyze request patterns
- Error Spikes
- Check service health
- Review error logs
- Monitor rate limits
- Verify configurations
- Connection Issues
- Check network connectivity
- Review connection limits
- Monitor timeout settings
- Verify DNS resolution
Next Steps
- Set up Prometheus and Grafana
- Configure alerting rules
- Create custom dashboards
- Implement logging integration